Efficient simulation system of quantum algorithm gates on classical computer based on fast algorithm

ABSTRACT

An efficient simulation system of quantum algorithm gates for classical computers with a Von Neumann architecture is described. In one embodiment, a Quantum Algorithm is solved using an algorithmic-based approach, wherein matrix elements of the quantum gate are calculated on demand. In one embodiment, a problem-oriented approach to implementing Grover&#39;s algorithm is provided with a termination condition determined by observation of Shannon minimum entropy. In one embodiment, a Quantum Control Algorithm is solved by using a reduced number of quantum operations.

BACKGROUND

1. Field of invention

The present invention relates to efficient simulation of quantum algorithms using classical computers with a Von Neumann architecture.

2. Description of the Related Art

Quantum algorithms (QA) hold great promise for solving many heretofore intractable problems where classical algorithms are inefficient. For example, quantum algorithms are particularly suited to factorization and/or searching problems where the computational complexity increases exponentially when using classical algorithms. Use of quantum algorithms on true quantum computers is, however, rare because there is currently no practical physical hardware implementation of a quantum computer. All quantum computers to date have been too primitive for practical use.

The difference between a classical algorithm and a QA lies in the way that the QA is coded in the structure of the quantum operators. The initial input to the QA is a quantum register loaded with a superposition of initial states. The output of the QA is a function of the problem being solved. In some sense, the QA is given a problem to analyze and the QA returns its qualitative property in quantitative form as an answer. Formally, the problems solved by a QA can be stated as follows:

-   -   Input: A function ƒ:         0,1         ^(n)→         0,1         ^(m)     -   Problem: Find a certain property of ƒ

Thus, the QA studies some qualitative properties of a function. The core of any QA is a set of unitary quantum operators or quantum gates. A quantum gate is a unitary matrix with a particular structure related to the algorithm needed to solve the given problem. The size of this matrix grows exponentially with the number of inputs, making it difficult to simulate a QA with more than 30-35 inputs on a classical computer with a Von Neumann architecture because of the memory required and the computational complexity of dealing with such a large matrix.

SUMMARY

The present invention solves these and other problems by providing an efficient simulation system of quantum algorithm gates and for classical Von Neumann computers. In one embodiment, a QA is solved using a matrix-based approach. In one embodiment, a QA is solved using an algorithmic-based approach wherein matrix elements of the quantum gate are calculated on demand. In one embodiment, a problem-oriented approach to implementing Grover's algorithm is provided with a termination condition determined by observation of Shannon entropy. In one embodiment, a QA is solved by using a reduced number of operators.

In one embodiment, at least some of the matrix elements of the QA gate are calculated as needed, thus avoiding the need to calculate and store the entire matrix. In this embodiment, the number of inputs that can be handled is affected by: (i) the exponential growth in the number of operations used to calculate the matrix elements; and (ii) the size of the state vector stored in the computer memory.

In one embodiment, the structure of the QA is used to provide an efficient algorithm. In Grover's QSA, the state vector always has one of the two different values: (i) one value corresponds to the probability amplitude of the answer; and (ii) the second value corresponds to the probability amplitude of the rest of the state vector. In one embodiment, two values are used to efficiently represent the floating-point numbers that simulate actual values of the probability amplitudes in the Grover's algorithm. For other QAs, more than two, but nevertheless a finite number of values will exist and such finiteness is used to provide an efficient algorithm.

In one embodiment, the QA is constructed or transformed such that entanglement and interference operators can by bypassed or simplified, and the result is computed based on superposition of the initial states (and deconstructive interference of final output patterns) representing the state of the designed schedule of control gains. In one embodiment, the Deutsch-Jozsa's algorithm, when entanglement is absent, is simulated by using pseudo-pure quantum states. In one embodiment, the Simon algorithm, when entanglement is absent, is simulated by using pseudo-pure quantum states. In one embodiment, an entanglement-free QA is used to optimize an intelligent control system.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows memory used versus the number of qubits in a MATLAB 6.0 simulation environment used for modeling quantum search algorithm.

FIG. 2 shows the time required to make a fixed number of iterations as a function of processor clock frequency on a computer with a Pentium III processor.

FIG. 3 shows a family of curves from FIG. 2 for 100 iterations.

FIGS. 4 a and 4 b show surface plots of the time required for a fixed number of iterations versus the number of qbits using processors of different internal frequency.

FIG. 5 shows a family of curves from FIG. 4 for 10 iterations.

FIG. 6 shows the time for one iteration of 11 qubits, including curves for computations only and computation plus virtual memory operations.

FIG. 7 shows the time for one iteration as a function of the number of qubits.

FIG. 8 shows comparisons of the memory needed for the Shor and Grover algorithms.

FIG. 9 shows the time required for a fixed number of iterations versus the number of qubits and versus the processor clock frequency.

FIG. 10 shows the time required for 10 iterations with different clock frequencies.

FIG. 11 shows the time required for one iteration as a function of the number of qubits.

FIG. 12 shows the time versus number of iterations and versus the number of qbits for the Shor and Grover algorithms.

FIG. 13 shows curves from FIG. 12 for 10 iterations.

FIG. 14 shows the spatial complexity of a quantum algorithm.

FIG. 15 shows the difference between two quantum algorithms due to demands on the processor front side bus.

FIG. 16 shows computational runtime differences between the Shor, Grover, and Deutch-Josza algorithms.

FIG. 17 a shows a generalized representation of a QA as a set of sequentially-applied smaller quantum gates.

FIG. 17 b shows an alternate representation of a QA.

FIG. 18 a shows a quantum state vector set up to an initial value.

FIG. 18 b shows the quantum state vector of FIG. 18 a after the superposition operator is applied.

FIG. 18 c shows the quantum state vector of FIG. 18 b after the entanglement operation in Grover's algorithm

FIG. 18 d shows the quantum state vector of FIG. 18 c after application of the interference operation.

FIG. 19 a shows the dynamics of Grover's QSA probabilities of the input state vector.

FIG. 19 b shows the dynamics of Grover's QSA probabilities of the state vector after superposition and entanglement.

FIG. 19 c shows the dynamics of Grover's QSA probabilities of the state vector after interference.

FIG. 20 shows the Shannon information entropy calculation for the Grover's algorithm with 5 inputs.

FIG. 21 shows spatial complexity of a Grover QA simulation.

FIG. 22 shows temporal complexity of Grover's QSA.

FIG. 23 shows Shannon entropy simulation of a QSA with 7-inputs.

FIG. 24 a shows the superposition operator representation algorithm for Grover's QSA.

FIG. 24 b shows an entanglement operator representation algorithm for Grover's QSA.

FIG. 24 c shows an interference operator representation algorithm for Grover's QSA.

FIG. 24 d shows an interference operator representation algorithm for Deutsch-Jozsa's QA.

FIG. 24 e shows an entanglement operator representation algorithm for Simon's and Shor's QA.

FIG. 24 f shows the superposition and interference operator representation algorithm for Simon's QA.

FIG. 24 g shows an interference operator representation algorithm for Shor's QA.

FIG. 25 shows state vector representation algorithm for Grover's quantum search.

FIG. 26 shows a generalized schema of simulation for Grover's QSA.

FIG. 27 shows the superposition block for Grover's QSA.

FIG. 28 a shows emulation of the entanglement operator application of Grover's QSA.

FIG. 28 b shows emulation of interference operator application of Grover's QSA.

FIG. 28 c shows the quantum step block for Grover's quantum search.

FIG. 29 shows the termination block for method 1.

FIG. 30 shows component B for the termination block.

FIG. 31 a shows component PUSH for the termination block.

FIG. 31 b shows component POP for the termination block.

FIG. 32 shows component C for the termination block.

FIG. 33 shows component D for the termination block.

FIG. 34 shows component E for the termination block.

FIG. 35 shows final measurement emulation.

FIG. 36 shows a generalized schema of simulation for Deutsch-Jozsa's QA.

FIG. 37 shows a quantum block HUD for Deutsch-Jozsa's QA.

FIG. 38 shows a generalized approach for QA simulation.

FIG. 39 shows query processing.

FIG. 40 shows a general structure of Quantum Soft Computing tools.

FIG. 41 a is a block diagram of an intelligent nonlinear control system.

FIG. 41 b shows a superposition of coefficient gains.

FIG. 42 shows the structure of the design process.

FIG. 43 shows robust KB design with a quantum algorithm.

FIG. 44 a shows coefficient gains of a Q-PD controller.

FIG. 44 b shows coefficient gains scheduled by a FC trained using Gaussian excitation.

FIG. 44 c shows coefficient gains scheduled by a FC trained using non-Gaussian excitation.

FIG. 44 d shows control object dynamics.

FIG. 45 shows simulation result of the FIG. 44 b, under non-gaussian excitation.

FIG. 46 shows the addition of a new Hadamard operator, as example, between the oracle (entanglement) and the diffusion operators in Grover's QSA.

FIG. 47 shows the steps of QSA2.

FIG. 48 shows one embodiment if a circuit implementation using elementary gates. The probability of finding a solution varies according to the number of matches M≠0 in the superposition.

FIG. 49 shows the probability of success of the QSA1 and QSA2 algorithms after one iteration.

FIG. 50 shows the iterating version of the algorithm QSA1.

FIG. 51 shows the iterating version of the QSA2 algorithm.

FIG. 52 shows the probability of success of the iterative version of the QSA1 algorithm.

FIG. 53 shows the probability of success of the iterative version of the algorithm QSA1 after five iterations.

FIG. 54 shows the probability of success of the iterative version of the QSA2 algorithm.

FIG. 55 shows the probability of success of the iterative version of the QSA2 algorithm after five iterations.

FIG. 56 a shows results from different approaches for simulation of Grover's QSA.

FIG. 56 b shows results from different approaches for simulation of Deutsch-Jozsa's QA.

FIG. 56 c shows results from different approaches for simulation of Simon's and Shor's quantum algorithms.

FIG. 57 a shows the optimal number of iterations for different qubit numbers and corresponding Shannon entropy behavior of Grover's QSA simulation.

FIG. 57 b shows results of Shannon entropy behavior for different qubit numbers (1-8) in Deutsch-Jozsa's QA.

FIG. 57 c shows results of Shannon entropy behavior for different qubit numbers (1-8) in Simon's QA.

FIG. 57 d shows results of Shannon entropy behavior for different qubit numbers (1-8) in Shor's QA.

FIG. 58 shows the optimal number of iterations for different database sizes.

FIG. 59 shows simulation results of problem oriented Grover QSA according to approach 4 with 1000 qubits.

FIG. 60 summarizes different approaches for QA simulation.

DETAILED DESCRIPTION

The simplest technique for simulating a Quantum Algorithm (QA) is based on the direct representation of the quantum operators. This approach is stable and precise, but it requires allocation of operator's matrices in the computer's memory. Since the size of the operators grows exponentially, this approach is useful for simulation of QAs with a relatively small number of qubits (e.g., approximately 11 qubits on a typical desktop computer). Using this approach it is relatively simple to simulate the operation of a QA and to perform fidelity analysis.

In one embodiment, a more efficient fast quantum algorithm simulation technique is based on computing all or part of the operator matrices on an as-needed basis. Using this technique, it is possible to avoid storing all or part of the operator matrices. In this case, the number of qubits that can be simulated (e.g., the number of input qubits, or the number of qubits in the system state register) is affected by: (i) the exponential growth in the number of operations required to calculate the result of the matrix products; and (ii) the size of the state vector that is allocated in computer memory. In one embodiment, using this approach it is reasonable to simulate up to 19 or more qubits on typical desktop computer, and even more on a system with vector architecture.

Due to particularities of the memory addressing and access processes in a typical desktop computer (such as, for example, a Pentium-based Personal Computer), when the number of qubits is relatively small, the compute-on-demand approach tends to be faster than the direct storage approach. The compute-on-demand approach benefits from a study of the quantum operators, and their structure so that the matrix elements can be computed more efficiently.

The study portion of the compute-on-demand approach can, for some QAs lead to a problem-oriented approach based on the QA structure and state vector behavior. For example, in Grover's Quantum Search Algorithm (QSA), the state vector always has one of the two different values: (i) one value corresponds to the probability amplitude of the answer; and (ii) the second value corresponds to the probability amplitude of the rest of the state vector. Using this assumption, it is possible to configure the algorithm using these two different values, and to efficiently simulate Grover's QSA. In this case, the primary limit is a representation of the floating-point numbers used to simulate the actual values of the probability amplitudes. After the superposition operation, these probability amplitudes are very small $\left( \frac{1}{2^{n/2}} \right).$ Thus, it is possible to simulate Grover's QSA with this approach simulating 1024 qubits or more without termination condition calculation and up to 64 qubits or more with termination condition estimation based on Shannon entropy.

Other QAs do not necessarily reduce to just two values. For those algorithms that reduce to a finite number of values, the techniques used to simplify the Gover QSA can be used, but the maximum number of input qubits that can be simulated will tend to be smaller, because the probability amplitudes of other algorithms have relatively more complicated distributions. Introduction of an external excitation can decrease the possible number of qubits for some algorithms.

In some algorithms, the entanglement and interference operators can be bypassed (or simplified), and the output computed based only on a superposition of the initial states (and deconstructive interference of the final output patterns) representing the state of the designed schedule of control gains. For example, a particular case of Deutsch-Jozsa's and Simon algorithms can be made entanglement free by using pseudo-pure quantum states.

The disclosure that follows begins with a comparative analysis of the temporal complexity of several representative QAs. That analysis is followed by an introduction of the generalized approach in QA simulation and algorithmic representation of quantum operators. Subsequent portions describe the structure representation of the QAs applicable to low level programming on classical computer (PC), generalizations of the approaches and introduction of the general QA simulation tool based on fast problem-oriented QAs. The simulation techniques are then applied to a quantum control algorithm.

1. Spatio-Temporal Complexity of QA Simulation Based on the Full Matrix Approach

I. Spatio-Temporal Complexity of Grover's Quantum Algorithm

1.1. Introduction

Practical realization of quantum search algorithms on classical computers is limited by the available hardware resources. Well-known algorithmic estimations for the number database transactions required by the Grover search algorithm cannot be considered directly on von Neumann computers. Classical versions of QAs depend on the effectiveness and efficiency of the mathematical models used to simulate the quantum-mechanical operations.

Thus, it is useful to analyze quantum algorithms to determine, or at least estimate, time expenses, influence of processor clock frequency, memory requirements, and Shannon entropy behavior of the QA. Evaluating time expenses of the Grover QSA includes evaluating the number of oracle queries (temporal complexity) for a fixed number of iterations of the Grover's QSA as a function of the number of qubits. Evaluating the effect of the central processor clock time includes estimating the influence of the central processor frequency on the time required for making a fixed number of iterations. Runtime does not necessarily scale linearly with processor clock speed due to effects of memory access, cache access, processor wait states, processor pipelines, processor branch estimation, etc. The required physical memory size (spatial complexity) depends on the algorithm and the number of qubits. The Shannon entropy behavior provides insight into the number of iterations required to arrive at a solution, and thus provides insight into the temporal complexity of the QA. The understanding gained from examining the spatio-temproral complexity helps in understanding the computing resources needed to simulate a desired QA with a desired number of qubits.

1.2. Computational Examples

FIG. 1 shows the memory requirements versus number of qubits for a MATLAB 6.0 simulation environment used for modeling a QSA. FIG. 1 shows that 128 MB of memory allows simulation of up to 8 qubits (corresponding to 2⁸ elements in the database). FIG. 2 shows the time required to simulate Grover's QSA versus the number of qubits and versus the number of iterations on a Pentium III computer with 128 MB of main memory and processor clock frequencies of 600, 800, and 1000 MHz. FIG. 3 shows the influence of processor internal frequency on the time required for making 100 iterations (from FIG. 2). As shown in FIG. 3, the runtime does not scale linearly with processor speed.

A linear increase of the number of qubits results in an exponential increase in the amount of memory required. In one embodiment, a computer with 512 MB of memory running MATLAB 6.0 is able to simulate 10 qubits before memory limitations begin to dominate. FIGS. 4 and 5 show runtime versus number of iterations and versus number of qubits (from 8 to 10) for the 512 MB hardware configuration.

Once the computer physical memory is full, a further increase in the number of qubits causes virtual memory paging and performance degrades rapidly, as shown in FIG. 6. FIG. 6 shows time required for making one iteration of Grover's QSA for 11 qubits on a computer with 512 MB of physical memory—with and without virtual memory operations. As shown in the figure, the time required to perform virtual memory operations accounts for 50-70% of the time required to do calculations only.

FIG. 7 shows the exponentially increasing time required for making one iteration versus the number of qubits (from 1 to 11) on a computer with 512 MB physical memory and an Intel Pentium III processor running at 800 MHz. Since the time required for making one iteration grows exponentially as the number of qubits increases, it is useful to determine the minimum number of iterations that guarantees a high probability of obtaining a correct answer.

The Shannon entropy can be considered as a criteria for solution of the QA-termination problem. Table 1.1 shows tabulated results of the number of qubits, Shannon entropy, and the number of iterations required. TABLE 1.1 Number of Shannon Number of qubit entropy iterations 1 2.0 1 2 1.0 2 3 1.00351 7 4 1.0965 10 4 1.00721 16 5 1.01362 5 6 1.05330 7 6 1.02879 32 7 1.07123 9 7 1.00021 27 8 1.00002 13 9 1.00024 18 10 1.00024 26

The timing results presented above are provided by way of explanation and for trend analysis, and not by way of limitation. Different programming systems would likely yield different absolute values for the measured quantities, but the trends would nevertheless remain. Thus, several observations can be drawn from the data shown in FIGS. 1-7. According to contemporary standards of personal computer hardware, QSAs can be adopted for relatively small databases (up to 2¹¹-2¹² elements). For a system with more than 2 qubits, the correct result calculation correlates with achieving a minimum value of Shannon entropy. Thus, the minimum number of iterations needed to achieve a desired accuracy can be estimated from the number of qubits.

II. Temporal complexity of Grover's quantum algorithm in comparison with Shor's QA

2.1. Introduction

The results in FIGS. 1-7 were obtained by simulating Grover's QSA. FIG. 8 shows a comparison of the memory used by Shor's algorithm as compared to Grover's algorithm for 1 to 5 qubits. As shown in FIG. 8, Shor's algorithm requires considerably more memory. The qualitative properties of functions analyzed by Grover algorithm take Boolean values “true” and “false.” By contrast, Shor's algorithm analyzes functions that can take various values as input parameters. This fact inevitably leads to a considerable increase in the amount of memory required for a given number of qubits. For Shor's algorithm, directly simulating a system with 5 qubits is practical, but a simulation with 6 qubits becomes impractical because the memory requirements are increasing exponentially. FIG. 9 shows the time required to run Shor's algorithm and Grover's algorithm versus the number of qubits and the number of iterations. FIG. 10 corresponds to FIG. 9 where the number of iterations is fixed at 10. FIG. 11 shows an exponential increase in the time required for making one iteration as the number of qubits increases from 1 to 5. FIG. 12 and FIG. 13 shows comparisons of computer hardware requirements of Shor's and Grover's quantum algorithms concerning time of execution.

The comparative analysis of Shor's and Grover's quantum algorithms afforded by FIGS. 8-12 shows that maximum number of qubits that can be simulated in Shor's algorithm is relatively smaller than in Grover's algorithm (for direct simulation). Since realization of Shor's algorithm on classical computers is more demanding to hardware resources than realization of Grover's algorithm, appropriate hardware acceleration for practically significant applications is relatively more important for Shor's algorithm than for Grover's algorithm.

III. Comparative Temporal Complexity of Grover's QA, Shor's QA and Deutsch-Jozsa's QA

FIG. 14 shows the runtime needed for 10 iterations of the Shor and Grover algorithms on a representative computer versus the number of qubits. The exponential increase shown by Shor's algorithm is much faster than the time increase shown by Grover's algorithm. FIG. 15 shows how the frequency of the processor front side bus (FSB) on a Pentium III processor affects the time needed to make one iteration of a QA.

FIG. 16 shows the runtime differences between the Shor, Grover, and Deutsch-Josza quantum algorithms as a function of the number of qubits. As shown in FIG. 16, Shor's algorithm runs considerably slower than either the Grover or the Deutsch-Josza algorithms. This result arises from the structure of Shor's algorithm. In Shor's quantum algorithm, the number of qubits used for measurement is equal to the number of input qubits. This means that running a Shor's algorithm simulation for 5 qubits is the same as running a Grover's algorithm simulation with 9 qubits. Moreover, Shor's algorithm requires twice as much memory in order to store with complex numbers. As shown in FIG. 16, for the tested hardware and software realization of Deutsch-Jozsa algorithm, simulation of systems with more than 11 qubits becomes increasingly impractical.

IV. Information Analysis of Quantum Complexity of QAs: Quantum Query Tree Complexity

The existing QAs described above can be naturally expressed using a black-box model. It is then useful to consider the spatio-temporal complexity of QAs from the quantum query complexity viewpoint. For example, in the case of Simon's problem, one is given a function ƒ:

0,1

^(n)→

0,1

^(n) and a promise that there is an s ε

0,1

^(n) such that ƒ(i)=ƒ(j)iff i=j⊕s. The goal is to determine whether s=0 or not. Simon's QA yields an exponential speed-up over a classical algorithm. Simon's QA requires an expected number of O (n) applications of ƒ, whereas, every classical randomized algorithm for the same problem must make Ω(√{overscore (2^(n))}) queries.

The function ƒ can be viewed as a black-box X=(x₀, . . . , x_(N−1)) of N=2^(n) bits, and that an ƒ-application can be simulated by n queries to X. Thus, Simon's problem fits squarely in the black-box setting, and exhibits an exponential quantum-classical separation for this promise-problem. The promise means that Simon's problem ƒ:

0,1

^(n)→

0,1

^(n) is partial; i.e., it is not defined on all X ε

0,1

^(n) but only on X that correspond to an X satisfying the promise.

Table 1.2 list the quantum complexity of various boolean functions such as OR, AND, PARITY, and MAJORITY TABLE 1.2 Some quantum complexities Function Exact Zero-error Bounde-error OR_(N), AND_(N) N N $\Theta\left( \sqrt{N} \right)$ PARITY_(N) $\frac{N}{2}$ $\frac{N}{2}$ $\frac{N}{2}$ MAJORITY_(N) Θ(N) Θ(N) Θ(N)

For example, consider the property OR_(N)(X)=x₀ ν . . . νx_(N−1). The number of queries required to compute OR_(N)(X) by any classical (deterministic or randomized) algorithm is Θ(N). The lower bound for OR implies a lower bound for the search problem where it is desired to find an i, such that x_(i)=1, if such an i exists. Thus, an exact or zero-error QSA requires N queries, in contrast to Θ(√{overscore (N)}) queries for the bounded-error case. On the other hand, the number of solutions is r and a solution can be found with probability 1 using $O\left( \sqrt{\frac{N}{k}} \right)$ queries. Grover discovered a QSA that can be used to compute OR_(N) with small error probability using only O(√{overscore (N)}) queries. In this case of OR_(N), the function is total; however, the quantum speed-up is only quadratic instead of exponential.

A similar result holds for the order-finding problem, which is the core of Shor's efficient quantum factoring algorithm. In this case, the promise is the periodicity of a certain function derived from the number to be factored.

A boolean function is a function ƒ:

0,1

^(n)→

0,1

. Note that ƒ is total, i.e., it is defined on all n-bit inputs. For an input x ε

0,1

^(n), x_(i) to denotes its i th bit, so x=

x₁ . . . x_(n)

. The expression |x| is used to denote the Hamming weight of x (its number of 1's). A more general form of a Boolean function can be defined as ƒ:

0,1

^(n) ⊃A→B=ƒ(A)⊂

0,1

^(m), for some integers n, m>0. If S is a set of (indices of) variables, then x^(s) denotes the input obtained by flipping the S-variables in x. The function ƒ is symmetric if ƒ(x) only depends on |x|. Some common symmetric functions are: $\begin{matrix} {{{{OR}_{n}(x)} = {{1\quad{iff}{x}} \geq 1}};} & (i) \\ {{{{AND}_{n}(x)} = {{1\quad{iff}{x}} = n}};} & ({ii}) \\ {{{{PARITY}_{n}(x)} = {1\quad{iff}{x}{is}\quad{odd}}};} & ({iii}) \\ {{{MAJ}_{n}(x)} = {{1\quad{iff}{x}} > {\frac{n}{2}.}}} & ({iv}) \end{matrix}$

The quantum oracle model is used to formalize a query to an input x ε

0,1

^(n) as a unitary transformation O that maps |i, b, z> to |i, b⊕x_(i), z> is most some m-qubit basis state, where i takes ┌log n┐ bits, b is one bit. The value z denotes the (m−┌log n┐−1)-bit “workspace” of the quantum computer, which is not affected by the query. Applying the operator O_(ƒ) twice is equivalent to applying the identity operator, and thus O_(ƒ) is unitary (and reversible) as required. The mapping changes the content of the second register (|b>) conditioned on the value of the first register |i>.

The queries are implemented using unitary transformations O_(j) in the following standard way. The transformation O_(j) only affects the leftmost part of a basis state: it maps basis state |i, b, z> to |i, b⊕x_(i), z>. Note that the O_(j) are all equal. This generalizes the classical setting where a query inputs an i into a black-box, which returns the bit x_(i). Applying O to the basis state |i,0,z> yields |i,x_(i),z>, from which the i th bit of the input can be read. Because O has to be unitary, it is specified to map |i,1,z> to |i,1−x_(i),z>. Note that a quantum computer can make queries in superposition: applying O once to the state ${\frac{1}{\sqrt{n}}{\sum\limits_{i = 1}^{n}{\left. {i,0,z} \right\rangle\quad{gives}\frac{1}{\sqrt{n}}{\sum\limits_{i = 1}^{n}\left. {i,x_{i},z} \right\rangle}}}},$ which in some sense contains all bits of the input.

A quantum decision tree has the following form: start with an m-qubit state |{right arrow over (0)}> where every bit is 0. Since it is desired to compute a function of X, which is given as a black-box, the initial state of the network is not very important and can be disregarded. Thus, the initial state is assumed to be |{right arrow over (0)}> always. Next, apply a unitary transformation U₀ to the state, then apply a query O, then another transformation U₁, etc. A T-query quantum decision tree thus, corresponds to a unitary transformation A=U_(T)OU_(T−1) . . . OU₁OU₀. Here the U_(i) are fixed unitary transformations, independent of the input x. The final state A|{right arrow over (0)}> depends on the input x only via the T applications of O. The output obtained by measuring the final state and outputting the rightmost bit of the observed basis state. Without loss of generality, it can be assumed that there are no intermediate measurements.

A quantum decision tree is said to compute ƒ exactly if the output equals ƒ(x) with probability 1, for all x ε

0,1

^(n). The tree computes ƒ with bounded-error if the output equals ƒ(x) with probability at least $\frac{2}{3},$ for all x ε

0,1

^(n).

The function Q_(E) (ƒ) denotes the number of queries of an optimal quantum decision tree that computes ƒ exactly, Q₂ (ƒ) is the number of queries of an optimal quantum decision tree that computes ƒ with bounded-error. Note that the number of queries is counted, not the complexity of the U_(i).

Unlike the classical deterministic or randomized decision trees, the QAs are not necessarily trees anymore (the names “quantum query algorithm” or “quantum black-box algorithm” can also be used). Nevertheless, the term “quantum decision tree” is useful, because such QAs generalize classical trees in the sense that they can simulate them as described below.

Consider a T-query deterministic decision tree. It first determines which variable it will query first; then it determines the next query depending upon its history, and so on for T queries. Eventually, it outputs an output-bit depending on its total history. The basis states of the corresponding QA have the form |i, b, h, a>, where i, b is the query-part, h ranges over all possible histories of the classical computation (this history includes all previous queries and their answers), and a is the rightmost qubit, which will eventually contain the output. Let U,map the initial state |{right arrow over (0)},0,{right arrow over (0)},0> to |i,0,{right arrow over (0)},0>, and x_(i) is the first variable that classical tree would query. Now, the QA applies O, which turns the state into |i, x_(i),{right arrow over (0)},0>. Then the algorithm applies a transformation U₁ that maps |i, x_(i),{right arrow over (0)},0> to |j,0,h,0), where h is the new history (which includes i and x_(i)) and x_(j) is the variable that the classical tree would query given the outcome of the previous query. Then when the quantum tree applies O for the second time, it applies a transformation U₂ that updates the workspace and determines the next query, etc. Finally, after T queries, the quantum tree sets the answer bit to 0 or 1 depending on its total history. All operations U_(i) performed here are injective mappings from basis states to basis states, hence they be extended to permutations of basis states, which are unitary transformations. Thus a T-query deterministic decision tree can be simulated by an exact a T-query quantum decision tree with the same error probability (basically because a superposition can “simulate” a probability distribution). Accordingly, Q₂(ƒ)≦R₂(ƒ)≦D(ƒ)≦n and Q₂(ƒ)≦Q_(E)(ƒ)≦D(ƒ)≦n for all ƒ.

If ƒ is non-constant and symmetric, then D(ƒ)=(1−o(1))n;   (i) R ₂(ƒ)=Θ(n);   (ii) Q _(E)(ƒ)=Θ(n);   (iii) Q ₂(ƒ)=Θ(√{overscore (n(n−Γ(ƒ)))}),   (iv) where Γ(ƒ)=min

|2k−n+1|:ƒ_(k)≠ƒ_(k+1)

is quantity measure of length of the interval around hamming weight $\frac{n}{2}$ where ƒ_(k) is constant. The function ƒ flips value if the hamming weight of the input changes from k to k+1 (this Γ(ƒ) is a number that is low if ƒ flips for inputs with hamming weight close to $\left. \frac{n}{2} \right).$ This can be compared with the classical bounded-error query complexity of such functions, which is Θ(n). Thus, Γ(ƒ) characterizes the speed-up that QAs give for all total functions.

Unlike classical decision trees, a quantum decision tree algorithm can make queries in a quantum superposition, and therefore, may be intrinsically faster than any classical algorithm. The quantum decision tree model can also be referred to as the quantum black-box model.

Let Q(ƒ) be the quantum decision tree complexity of ƒ with error-bounded probability by $\frac{1}{3}.$ It is possible to derive a general lower bound for Q(ƒ) in terms of Shannon entropy S^(Sh) (ƒ) defined as follows. For any ƒ, define the entropy of ƒ, S^(Sh)(ƒ), to be the Shannon entropy of ƒ(X), where X is taken uniformly random from A: ${{S^{Sh}(f)} = {- {\sum\limits_{y \in B}{p_{y}\log_{2}p_{y}}}}},$ where p_(y)=Pr_(xε) _(R) _(A)[ƒ(x)=y]. For any ƒ, $\begin{matrix} {{Q(f)} = {{\Omega\left( \frac{S^{Sh}(f)}{\log\quad n} \right)}.}} & \left( {1\text{.}1} \right) \end{matrix}$

In this case, the computation process can be viewed as a process of communication. To make a query, the algorithm sends the oracle ┌log n┐ bits, which are then returned by the oracle. The first ┌log n┐ bits specify the location of the input bit being queried and the remaining one bit allows the oracle to write down the answer. The QA runs on ${\frac{1}{\sqrt{A}}{\sum\limits_{x \in A}{\left. x \right\rangle_{X}\left. y \right\rangle_{Y}}}},$ where X(Y) denotes the qubits that hold the input (intermediate results of computing), respectively. It is useful to now consider the von Neumann entropy, S^(vN(t))(ƒ), of the density matrix ρ_(Y) after t th query. If the QA computes ƒ in T queries, at the end of computation, one expect to have a vector close to $\left. \left. {\frac{1}{\sqrt{A}}\sum\limits_{x \in A}} \middle| x \right\rangle_{X} \middle| {f(x)} \right\rangle_{Y}.$ For the initial (pure) state, S^(vN(0))(ƒ)=0. By using Holevo's theorem, one can show that S^(vN(T))(ƒ)≈S^(Sh)(ƒ). Furthermore, by the sub-additivity of the von Neumann entropy |S ^(vN(t+1))(ƒ)−S ^(vN(t))(ƒ)|=O(log n) for any t with 0≦t≦T−1 .

Therefore, $T = {{\Omega\left( \frac{S^{Sh}(f)}{\log\quad n} \right)}.}$ This bound is tight.

This means one quantum query can get log n bits of information, while any classical query get no more than 1 bit of information. This power of getting ω(1) bits of information from a query is not useful in computing total functions, which are functions that are defined on every string in

0,1

^(n), in the sense that each quantum query can only yield O(1) bits of information on average.

For this more general case, for any total function ƒ, Q(ƒ)=Ω(S ^(Sh)(ƒ)).   (1.2)

Thus, the minimum of Shannon entropy in the final solution output of the QA means its has minimal quantum query complexity. The interrelations in Eqs (1.1) and (1.2) between quantum query complexity and Shannon entropy are used in the solution of QA-termination problem (see below in Section 3). As mentioned above, the number of queries is counted, not the complexity of the U_(i). The complexity of a quantum operator U_(i) and its interrelations with the temporal complexity of a QA is considered below.

The matrix-based approach can be efficiently realized for a small number of input qubits. The matrix approach is used above as a useful tool to illustrate complexity issues associated with QA simulation on classical computer.

2. Algorithmic Representation of the Quantum Operators and Quantum Algorithms

2.1. Structure of QA Gate System Design

As shown in FIG. 17 a, a QA simulation can be represented as a generalized representation of a QA as a set of sequentially-applied smaller quantum gates. From the structural point of view, each QA is based on a particular set of quantum gates, but generally speaking, each particular set can be divided into superposition operators, entanglement operators, and interference operators.

This division into superposition operators, entanglement operators, and interference operators permits a generalization of the design of a simulation and allows creation of a classical tool to simulate QAs. Moreover, local optimization of QA components according to specific hardware realization makes it possible to develop appropriate hardware accelerators for QA simulation using classical gates.

2.2. Generalized Approach in QA Simulation

In general, any QA can be represented as a circuit of smaller quantum gates as shown in FIGS. 17 a-b. The circuit shown in the FIG. 17 a is divided into five general layers: input, superposition, entanglement, interference, output.

Layer 1: Input. The quantum state vector is set up to an initial value for this concrete algorithm. For example, input for Grover's QSA is a quantum state |φ₀> described as a tensor product $\begin{matrix} \begin{matrix} {\left. {\left. {\left. {\left. {\left. {\left. {\left. \left| \phi_{0} \right. \right\rangle = \left. a_{1} \middle| 0 \right.} \right\rangle \otimes \ldots \otimes} \middle| 0 \right\rangle \otimes} \middle| 0 \right\rangle + a_{2}} \middle| 0 \right\rangle \otimes \ldots \otimes} \middle| 0 \right\rangle \otimes} \middle| 1 \right\rangle +} \\ \left. {\left. {\left. {\left. {\left. {\left. \left. a_{3} \middle| 0 \right. \right\rangle \otimes \ldots \otimes} \middle| 1 \right\rangle \otimes} \middle| 0 \right\rangle + \ldots + a_{n}} \middle| 1 \right\rangle \otimes \ldots \otimes} \middle| 1 \right\rangle \otimes} \middle| 1 \right\rangle \\ \left. {\left. {\left. {= \left. 1 \middle| 0 \right.} \right\rangle \otimes \ldots \otimes} \middle| 0 \right\rangle \otimes} \middle| 1 \right\rangle \\ {\left. {= \left| {0{\ldots 01}} \right.} \right\rangle,} \end{matrix} & (2.1) \\ {{\left. {{\left. {where}\quad \middle| 0 \right\rangle = \begin{pmatrix} 1 \\ 0 \end{pmatrix}};} \middle| 1 \right\rangle = \begin{pmatrix} 0 \\ 1 \end{pmatrix}};} & \quad \end{matrix}$ {circle around (×)} denotes Kronecker tensor product operation. Such a quantum state can be presented as shown on the FIG. 18 a.

The coefficients a_(i) in the Eq. (2.1) are called probability amplitudes. Probability amplitudes can take negative and/or complex values. However, the probability amplitudes must obey the following constraint: $\begin{matrix} {{\sum\limits_{i}a_{i}^{2}} = 1} & (2.2) \end{matrix}$

The actual probability of the arbitrary quantum state a_(i) |i> to be measured is calculated as a square of its probability amplitude value p_(i)=|a_(i)|².

Layer 2: Superposition. The state of the quantum state vector is transformed by the Walsh-Hadamard operator so that probabilities are distributed uniformly among all basis states. The result of the superposition layer of Grover's QSA is shown in FIG. 18 b as a probability amplitude representation, and also in FIG. 19 b as a probability representation.

Layer 3: Entanglement. Probability amplitudes of the basis vector corresponding to the current problem are flipped while rest basis vectors left unchanged. Entanglement is typically provided by controlled-NOT (CNOT) operations. FIGS. 18 c and 19 c show results of entanglement from the application of the operator to the state vector after superposition operation. An entanglement operation does not affect the probability of the state vector to be measured. Rather, entanglement prepares a state, which cannot be represented as a tensor product of simpler state vectors. For example, consider state φ₁ shown in the FIG. 18 b and state φ₂ presented on the FIG. 18 c: $\begin{matrix} {\left. {\left. {\left. {\left. {\left. {\left. {\phi_{1} = {0.35355\left( \left| 000 \right. \right.}} \right\rangle -} \middle| 001 \right\rangle +} \middle| 010 \right\rangle -} \middle| 011 \right\rangle +} \middle| 100 \right\rangle -} \middle| 101 \right\rangle +} \\ \left. \left. {\left. \left| 110 \right. \right\rangle -} \middle| 111 \right\rangle \right) \\ \left. \left. {{\left. \left. \left. {\left. {= \left. {{0.35355\left( \left| 00 \right. \right\rangle} +} \middle| 01 \right.} \right\rangle +} \middle| 10 \right\rangle \middle| 11 \right\rangle \right)\left( \left| 0 \right. \right\rangle} -} \middle| 1 \right\rangle \right) \end{matrix}$ $\begin{matrix} {\left. {\left. {\left. {\left. {\left. {\phi_{2} = \left. {{0.35355\left( \left| 000 \right. \right\rangle} -} \middle| 001 \right.} \right\rangle -} \middle| 010 \right\rangle +} \middle| 011 \right\rangle +} \middle| 100 \right\rangle -} \middle| 101 \right\rangle +} \\ \left. \left. {\left. \left| 110 \right. \right\rangle -} \middle| 111 \right\rangle \right) \\ {\left. \left. \left. {\left. {\left. {= \left. {{0.35355\left( \left| 00 \right. \right\rangle} -} \middle| 01 \right.} \right\rangle +} \middle| 10 \right\rangle +} \middle| 11 \right\rangle \right) \middle| 0 \right\rangle - {0.35355\left( \left| 00 \right. \right\rangle} +} \\ \left. \left. \left. {\left. {\left. \left| 01 \right. \right\rangle +} \middle| 10 \right\rangle +} \middle| 11 \right\rangle \right) \middle| 1 \right\rangle \end{matrix}$

As shown above, the description of state φ₁ can be presented as a tensor product of simpler states, while state φ₂ (in the measurement basis

|0>,|1

) cannot.

Layer 4: Interference. Probability amplitudes are inverted about the average value. As a result, the probability amplitude of states “marked” by entanglement operation will increase. FIGS. 18 d and 19 d show the results of interference operator application. FIG. 18 d shows probability amplitudes and FIG. 19 d shows probabilities.

Layer 5: Output. The output layer provides the measurement operation (extraction of the state with maximum probability), followed by interpretation of the result. For example, in the case of Grover's QSA, the required index is coded in the first n bits of the measured basis vector.

Since the various layer of the QA are realized by unitary quantum operators, simulation of quantum operators depend on simulation of such unitary operators. Thus, in order to develop an efficient, simulation, it is useful to understand the nature of the QAs basic quantum operators.

2.3. Basic QA Operators

The superposition, entanglement and interference operators are now considered from the simulation viewpoint. In this case, the superposition operators and the interference operators have more complicated structure and differ from algorithm to algorithm. Thus, it is first useful to consider the entanglement operators, since they have a similar structure for all QAs, and differ only by the function being analyzed.

In general, the superposition operator is based on the combination of the tensor products Hadamard H operators $H = {\frac{1}{\sqrt{2}}\begin{bmatrix} 1 & 1 \\ 1 & {- 1} \end{bmatrix}}$ with identity operator I: $I = {\begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix}.}$

For most QAs the superposition operator can be expressed as $\begin{matrix} {{{Sp} = {\left( {\underset{i = 1}{\overset{n}{\otimes}}H} \right) \otimes \left( {\underset{i = 1}{\overset{m}{\otimes}}S} \right)}},} & (2.3) \end{matrix}$

where n and m are the numbers of inputs and of outputs respectively. The operator S depends on the algorithm and can be either the Hadamard operator H or the identity operator I. The numbers of outputs m as well as structures of the corresponding superposition and interference operators are presented in Table 2.1 for different QAs. TABLE 2.1 Parameters of superposition and interference operators of main quantum algorithms Algorithm Superposition m Interference Deutsch's H

I 1 H

H Deutsch- ^(n)H

H 1 ^(n)H

I Jozsa's Grover's ^(n)H

H 1 D_(n)

I Simon's ^(n)H

^(n)I n ^(n)H

^(n)I Shor's ^(n)H

^(n)I n QFT_(n)

^(n)I

Superposition and interference operators are often constructed as tensor powers of the Hadamard operator, which is called the Walsh-Hadamard operator. Elements of the Walsh-Hadamard operator can be obtained as $\begin{matrix} {{\left\lbrack {\,^{n}H} \right\rbrack_{i,j} = {{\frac{\left( {- 1} \right)^{i*j}}{2^{n/2}}\left\lbrack {\,^{n - 1}H} \right\rbrack} = {\frac{1}{2^{n/2}}\begin{pmatrix} {\,^{({n - 1})}H} & {\,^{({n - 1})}H} \\ {\,^{({n - 1})}H} & {- {\,^{({n - 1})}H}} \end{pmatrix}}}},} & (2.4) \end{matrix}$ where i=0,1, j=0,1, H denotes Hadamard matrix of ordder 2.

The rule in Eq. (2.4) provides way to speed up of the classical simulation of the Walsh-Hadamard operators, because the elements of the operator can be obtained by the simple replication described in Eq. (2.4) from the elements of the ^(n−1)H order operator. For example, consider the superposition operator of Deutsch's algorithm, n=1, m=1, S=I: $\begin{matrix} \begin{matrix} {\lbrack{Sp}\rbrack_{i,j}^{Deutsch} = {\frac{\left( {- 1} \right)^{i*j}}{2^{1/2}} \otimes I}} \\ {= {\frac{1}{\sqrt{2}}\begin{pmatrix} {\left( {- 1} \right)^{0*0}I} & {\left( {- 1} \right)^{0*1}I} \\ {\left( {- 1} \right)^{1*0}I} & {\left( {- 1} \right)^{1*1}I} \end{pmatrix}}} \\ {= {\frac{1}{\sqrt{2}}\begin{bmatrix} I & I \\ I & {- I} \end{bmatrix}}} \end{matrix} & (2.5) \end{matrix}$

As a further example, consider the superposition operator of Deutsch-Jozsa's and of Grover's algorithm, for the case n=2, m=1, S=H: $\begin{matrix} {\begin{matrix} {\lbrack{Sp}\rbrack^{{{Deutsch}\text{-}{{Jozsa}'}s},{{{Grover}'}s}} = {{\,^{2}H} \otimes H}} \\ {= {\left( \frac{1}{\sqrt{8}} \right){\,^{3}H}}} \\ {= {\frac{1}{\sqrt{8}}\begin{pmatrix} {\,^{2}H} & {\,^{2}H} \\ {\,^{2}H} & {- {\,^{2}H}} \end{pmatrix}}} \\ {{= {\frac{1}{\sqrt{8}}\begin{pmatrix} H & H & H & H \\ H & {- H} & H & {- H} \\ H & H & {- H} & {- H} \\ H & {- H} & {- H} & H \end{pmatrix}}},} \end{matrix}{{{where}\quad H} = \begin{pmatrix} 1 & 1 \\ 1 & {- 1} \end{pmatrix}}} & (2.6) \end{matrix}$

For yet another example, the superposition operator of Simon's and of Shor's algorithms, n=2, m=2, S=I can be expressed as: $\begin{matrix} {\lbrack{Sp}\rbrack_{i,j}^{{Simon},{Shor}} = {{\,^{2}H} \otimes {\,^{2}I}}} \\ {= {\frac{1}{2}{\begin{pmatrix} {\left( {- 1} \right){\,^{0*0}H}} & {\left( {- 1} \right){\,^{1*0}H}} \\ {\left( {- 1} \right){\,^{1*0}H}} & {\left( {- 1} \right){\,^{1*1}H}} \end{pmatrix} \otimes {\,^{2}I}}}} \\ {= {\frac{1}{2}{\begin{pmatrix} H & H \\ H & {- H} \end{pmatrix} \otimes {\,^{2}I}}}} \\ {= {\frac{1}{2}{\begin{pmatrix} 1 & 1 & 1 & 1 \\ 1 & {- 1} & 1 & {- 1} \\ 1 & 1 & {- 1} & {- 1} \\ 1 & {- 1} & {- 1} & 1 \end{pmatrix} \otimes {\,^{2}I}}}} \\ {= {\frac{1}{2}\begin{pmatrix} {\,^{2}I} & {\,^{2}I} & {\,^{2}I} & {\,^{2}I} \\ {\,^{2}I} & {- {\,^{2}I}} & {\,^{2}I} & {- {\,^{2}I}} \\ {\,^{2}I} & {\,^{2}I} & {- {\,^{2}I}} & {\,{-^{2}I}} \\ {\,^{2}I} & {\,{-^{2}I}} & {- {\,^{2}I}} & {\,^{2}I} \end{pmatrix}}} \end{matrix}$

Interference operators are calculated for each algorithm according to the parameters listed in Table 2.1. The interference operator is based on the interference layer of the algorithm, which is different for various algorithms, and from the measurement layer, which is the same or similar for most algorithms and includes the m^(th) tensor power of the identity operator.

The interference operator of Deutsch's algorithm includes the tensor product of two Hadamard transformations, and can be calculated using Eq. (2.4) with n=2 as: $\begin{matrix} \begin{matrix} {\left\lbrack {Int}^{Deutsch} \right\rbrack_{i,j} = {{\,^{2}H} = \frac{\left( {- 1} \right)^{i*j}}{2^{2/2}}}} \\ {= {\frac{1}{2}\begin{pmatrix} {\left( {- 1} \right){\,^{0*0}H}} & {\left( {- 1} \right){\,^{0*1}H}} \\ {\left( {- 1} \right){\,^{1*0}H}} & {\left( {- 1} \right){\,^{1*1}H}} \end{pmatrix}}} \\ {= {\frac{1}{2}\begin{pmatrix} 1 & 1 & 1 & 1 \\ 1 & {- 1} & 1 & {- 1} \\ 1 & 1 & {- 1} & {- 1} \\ 1 & {- 1} & {- 1} & 1 \end{pmatrix}}} \end{matrix} & (2.7) \end{matrix}$

In Deutsch's algorithm, the Walsh-Hadamard transformation in the interference operator is used also for the measurement basis.

The interference operator of Deutsch-Jozsa's algorithm includes the tensor product of the n^(th) power of the Walsh-Hadamard operator with an identity operator. In general form, the block matrix of the interference operator of Deutsch-Jozsa's algorithm can be written as from the n−1 order matrix as: $\begin{matrix} {\begin{matrix} {\left\lbrack {Int}^{{Deutsch}\text{-}{{Jozsa}'}s} \right\rbrack = {{\,^{n}H} \otimes I}} \\ {{= {\frac{1}{2^{n/2}}{\begin{pmatrix} {\,^{({n - 1})}H} & {\,^{({n - 1})}H} \\ {\quad^{({n - 1})}H} & {- {\,^{({n - 1})}H}} \end{pmatrix} \otimes I}}},} \end{matrix}{{{{where}\quad H} = \begin{pmatrix} 1 & 1 \\ 1 & {- 1} \end{pmatrix}},}} & (2.8) \end{matrix}$

Interference operator of Deutsch-Jozsa's algorithm, n=2, m=1: $\begin{matrix} {\left\lbrack {Int}^{{Deutsch}\text{-}{{Jozsa}'}s} \right\rbrack = {{\,^{2}H} \otimes I}} \\ {= {\frac{1}{2}{\begin{pmatrix} H & H \\ H & {- H} \end{pmatrix} \otimes I}}} \\ {= {\frac{1}{2}{\begin{pmatrix} I & I & I & I \\ I & {- I} & I & {- I} \\ I & I & {- I} & {- I} \\ I & {- I} & {- I} & I \end{pmatrix}.}}} \end{matrix}$

The interference operator of Grover's algorithm can be written as a block matrix of the following form: $\begin{matrix} {\begin{matrix} {\left\lbrack {Int}^{Grover} \right\rbrack_{i,j} = {D_{n} \otimes I}} \\ {= {\left( {\frac{1}{2^{n/2}} - {\,^{n}I}} \right) \otimes I}} \\ {{= \left. {\left( {{- 1} + \frac{1}{2^{n/2}}} \right) \otimes I} \right|_{i = j}},} \end{matrix}{\left. {\left( \frac{1}{2^{n/2}} \right) \otimes I} \right|_{i \neq j} = {\frac{1}{2^{n/2}}\left\{ \begin{matrix} {{- I},{i = j}} \\ {I,{i \neq j}} \end{matrix} \right.}}} & (2.9) \end{matrix}$ where i=0, . . . , 2^(n)−1, j=0, . . . , 2^(n)−1, D_(n) refers to diffusion operator $\left\lbrack D_{n} \right\rbrack_{i,j} = {\frac{\left( {- 1} \right)^{1\quad{AND}\quad{({i = j})}}}{2^{n/2}}.}$

For example, the interference operator for Grover's QSA, when n=2, m=1 is: $\begin{matrix} {\begin{matrix} {\left\lbrack {Int}^{Grover} \right\rbrack_{i,j} = {D_{2} \otimes I}} \\ {= {\left( {\frac{1}{2^{2/2}} - {\,^{2}I}} \right) \otimes I}} \\ {{= \left. {\left( {{- 1} + \frac{1}{2}} \right) \otimes I} \right|_{i = j}},} \end{matrix}{\left. {\frac{1}{2} \otimes I} \right|_{i \neq j} = {\frac{1}{2}\begin{pmatrix} {- I} & I & I & I \\ I & {- I} & I & I \\ I & I & {- I} & I \\ I & I & I & {- I} \end{pmatrix}}}} & (2.10) \end{matrix}$

As the number of qubits increases, the gain coefficient will become smaller. The dimension of the matrix increases according to 2^(n), but each element can be extracted using Eq. (2.9), without allocation of the entire operator matrix.

The interference operator of Simon's algorithm is prepared in the same manner as the superposition (as well as superposition operators of Shor's algorithm) and can be described as follows from Eq. (2.3) and Eq. (2.6): ${\left\lbrack {Int}^{Simon} \right\rbrack_{({i,j})} = {{{\,^{n}H} \otimes {\,^{m}I}} = {\frac{\left( {- 1} \right)^{({i*j})}}{2^{n/2}}{{\,^{({n - 1})}H} \otimes {\,^{m}I}}}}},{{{where}\quad H} = \begin{pmatrix} 1 & 1 \\ 1 & {- 1} \end{pmatrix}}$

In general, the interference operator of Simon's algorithm coincides with the interference operator of Deutsch-Jozsa's algorithm Eq. (2.8), but for each block of the operator matrix includes m tensor products of the identity operator.

The Interference operator of Shor's algorithm uses the Quantum Fourier Transformation operator (QFT), calculated as: $\begin{matrix} {{\left\lbrack {QFT}_{n} \right\rbrack_{i,j} = {\frac{1}{2^{n/2}}{\mathbb{e}}^{{J{({i*j})}}\frac{2\pi}{2^{n}}}}},} & (2.11) \end{matrix}$ where: J=√{overscore (−1)}, i=0, . . . , 2^(n)−1 and, j=0, . . . , 2^(n)−1.

When n=1 then: $\begin{matrix} \begin{matrix} {\left. {QFT}_{n} \right|_{n = 1} = {\frac{1}{2^{\frac{1}{2}}}\begin{pmatrix} {\mathbb{e}}^{J*{({0*0})}2{\pi/2^{1}}} & {\mathbb{e}}^{J*{({0*1})}2{\pi/2^{1}}} \\ {\mathbb{e}}^{J*{({1*0})}2{\pi/2^{1}}} & {\mathbb{e}}^{J*{({1*1})}2{\pi/2^{1}}} \end{pmatrix}}} \\ {= {{\frac{1}{\sqrt{2}}\begin{pmatrix} 1 & 1 \\ 1 & {- 1} \end{pmatrix}} = H}} \end{matrix} & (2.12) \end{matrix}$

Eq. (2.11) can also be presented in harmonic form using the Euler formula: $\begin{matrix} {\left\lbrack {QFT}_{n} \right\rbrack_{i,j} = {\frac{1}{2^{\frac{n}{2}}}\left( {{\cos\left( {\left( {i*j} \right)\frac{2\pi}{2^{n}}} \right)} + {J\quad{\sin\left( {\left( {i*j} \right)\frac{2\pi}{2^{n}}} \right)}}} \right)}} & (2.13) \end{matrix}$

For some applications, the harmonic form of Eq (2.13) is preferable.

In general, entanglement operators are part of a QA when the information about the function being analyzed is coded as an input-output relation. Thus, it is useful to develop a general approach for coding binary functions into corresponding entanglement gates. Consider the arbitrary binary function: ƒ:

0,1

^(n)→

0,1

^(m), such that: ƒ(x₀, . . . , X_(n−1))=(y₀, . . . , y_(m−1))

In order to create unitary quantum operator, which performs the same transformation, first transform the irreversible function ƒ into a reversible function F, as follows: F:

0,1

^(m+n)→

0,1

^(m+n), such that: F(x₀, . . . , x_(n−1), y₀, . . . , y_(m−1))==(x₀, . . . , x_(n−1), ƒ(x₀, . . . , x_(n−1))⊕(y₀, . . . , Y_(m−1))) where ⊕ denotes addition modulo 2.

For the reversible function F, it is possible to design an entanglement operator matrix using the following rule: ${\left\lbrack U_{F} \right\rbrack_{i^{B},j^{B}} = {{1\quad{iff}\quad{F\left( j^{B} \right)}} = i^{B}}},i,{j \in \left\lbrack {\underset{n + m}{\underset{︸}{0,\ldots\quad,0}};\underset{n + m}{\underset{︸}{1,\ldots\quad,1}};} \right\rbrack},$ where B denotes binary coding. The resulting entanglement operator is a block diagonal matrix, of the form: $\begin{matrix} {U_{F} = \begin{pmatrix} M_{0} & \quad & 0 \\ \quad & ⋰ & \quad \\ 0 & \quad & M_{2^{n} - 1} \end{pmatrix}} & (2.14) \end{matrix}$

Each block M_(i),i=0, . . . , 2^(n)−1 includes m tensor products of I or of C operators, and can be obtained as follows: $\begin{matrix} {M_{i} = {\overset{m - 1}{\underset{k = 0}{\otimes}}\left\{ {\begin{matrix} {I,} & {{{iff}\quad{F\left( {i,k} \right)}} = 0} \\ {C,} & {{{iff}\quad{F\left( {i,k} \right)}} = 1} \end{matrix},} \right.}} & (2.15) \end{matrix}$ where C represents the NOT operator, defined as: $C = {\begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix}.}$ The entanglement operator is a sparse matrix. Using sparse matrix operations it is possible to accelerate the simulation of the entanglement. Each row or column of the entanglement operation has only one position with non-zero value. This is a result of the reversibility of the function F.

For example, consider the entanglement operator for a binary function with two inputs and one output: ƒ:

0,1

²→

0,1

¹, such that: ƒ(x)=1|_(x=01)0|_(x≠01). The reversible function F in this case is:

F:

0,1

³→

0,1

³, such that: $\begin{matrix} \left( {x,y} \right) & \left( {x,{{f(x)} \oplus y}} \right) \\ {00\text{,}0} & {{{00\text{,}0} \oplus 0} = 0} \\ {00\text{,}1} & {{{00\text{,}0} \oplus 1} = 1} \\ {01\text{,}0} & {{{01\text{,}1} \oplus 0} = 1} \\ {01\text{,}1} & {{{01\text{,}1} \oplus 1} = 0} \\ {10\text{,}0} & {{{10\text{,}0} \oplus 0} = 0} \\ {10\text{,}1} & {{{10\text{,}1} \oplus 0} = 1} \\ {11\text{,}0} & {{{11\text{,}0} \oplus 0} = 0} \\ {11\text{,}1} & {{{11\text{,}1} \oplus 0} = 1} \end{matrix}\quad$

The corresponding entanglement block matrix can be written as:   ⟨00  ⟨01  ⟨10  ⟨11 $U_{F} = {\begin{matrix} \left. 00 \right\rangle \\ \left. 01 \right\rangle \\ \left. 10 \right\rangle \\ \left. 11 \right\rangle \end{matrix}\begin{pmatrix} I & 0 & 0 & 0 \\ 0 & C & 0 & 0 \\ 0 & 0 & I & 0 \\ {\quad 0} & {0\quad} & {0\quad} & {I\quad} \end{pmatrix}}$

FIG. 18 c shows the result of the application of this operator in Grover's QSA. Entanglement operators of Deutsch and of Deutsch-Jozsa's algorithms have the general form shown in the above equation.

As a further example, consider the entanglement operator for a binary function with two inputs and two outputs: ƒ:

0,1

²→

0,1

², such that: ƒ(x)=10|_(x=01,11)00|_(x≠01,11) and   ⟨00  ⟨01  ⟨10  ⟨11 $U_{F} = {\begin{matrix} \left. 00 \right\rangle \\ \left. 01 \right\rangle \\ \left. 10 \right\rangle \\ \left. 11 \right\rangle \end{matrix}\begin{pmatrix} {I \otimes I} & 0 & 0 & 0 \\ 0 & {C \otimes I} & 0 & 0 \\ 0 & 0 & {I \otimes I} & 0 \\ {\quad 0} & {0\quad} & {0\quad} & {C \otimes I} \end{pmatrix}}$

The entanglement operators of Shor's and of Simon's algorithms have the general form shown in the above equation.

2.4. Results of Classical QA Gate Simulation

Analyzing the quantum operators described in Section 2.2 above leads to the following simplifications for increasing the performance of classical QA simulations:

-   -   a) All quantum operators are symmetrical around main diagonal         matrices.     -   b) The state vector is a sparse matrix.     -   c) Elements of the quantum operators need not be stored, but         rather can be calculated when necessary using Eqs. (2.6),         (2.12), (2.14) and (2.15);     -   d) The termination condition can be based on the minimum of         Shannon entropy of the quantum state, calculated as:         $\begin{matrix}         {H = {- {\sum\limits_{i = 0}^{2^{m + n}}{p_{i}\log\quad p_{i}}}}} & (2.16)         \end{matrix}$

Calculation of the Shannon entropy is applied to the quantum state after the interference operation. The minimum of Shannon entropy in Eq. (2.16) corresponds to the state when there are few state vectors with high probability (states with minimum uncertainty are intelligent states).

Selection of an appropriate termination condition is important since QAs are periodical. FIG. 20 shows results of the Shannon information entropy calculation for the Grover's algorithm with 5 inputs. FIG. 20 shows that for five inputs of the Grover's QSA an optimal number of iterations, according to minimum of the Shannon entropy criteria for successful result, is exactly four. With more iterations, the probability of obtaining a correct answer will decrease and the algorithm may fail to produce a correct answer. The theoretical estimation for 5 inputs gives π/4√{overscore (2⁵)}=4.44 iterations. The Shannon entropy-based termination condition provides the number of iterations. More detailed description of the information-based termination condition is presented in Section 2.5.

Simulation results of a fast Grover QSA are summarized in Table 2.2. The number of iterations for the fast algorithm is estimated according to the termination condition based on minimum of Shannon entropy of the quantum intelligent state vector. TABLE 2.2 Temporal complexity of Grover's QSA simulation on 1.2 GHz computer with two CPUs Temporal complexity, seconds Approach 1 Approach 2 n Number of iterations h (one iteration) (h iterations) 10 25 0.28 ˜0 12 50 5.44 ˜0 14 100 99.42 ˜0 15 142 489.05 ˜0 16 201 2060.63 ˜0 20 804 — ˜0 30 25.375 — 0.016 40 853.549 — 4.263 50 26.353.589 — 12.425

The following approaches were used in the simulations listed in Table 2.2. In Approach 1, the quantum operators are applied as matrices, elements of quantum operator matrices are calculated dynamically according to Eqs. (2.6), (2.12), (2.14) and (2.15). As shown in FIG. 21, the classical hardware limit of this approach to simulation on a desktop computer is around 20 or more qubits, caused by an exponential temporal complexity.

In Approach 2, the quantum operators are replaced with classical gates. Product operations are removed from the simulation as described above in Section 2.2. The state vector of probability amplitudes is stored in compressed form (only different probability amplitudes are allocated in memory). FIG. 22 shows that with the second approach, it is possible to perform classical efficient simulation of Grover's QSA on a desktop computer with a relatively large number of inputs (50 qubits or more). FIG. 22 shows that with allocation of the state vector in computer memory, this approach permits simulation 26 qubits on a conventional PC with 1 GB of RAM. By contrast, FIG. 21 shows memory required for Grover's algorithm simulation when the entire state vector is stored in memory. Adding one qubit doubles the computer memory needed for simulation of Grover's QSA when state vector is allocated completely in memory.

2.5. Information Criteria for Solution of the QSA-Termination Problem

Quantum algorithms come in two general classes: algorithms that rely on a Fourier transform, and algorithms that rely on amplitude amplification. Typically, the algorithms includes a sequence of trials. After each trial, a measurement of the system produces a desired state with some probability determined by the amplitudes of the superposition created by the trial. Trials continue until the measurement gives a solution, so that the number of trials and hence, the running time are random.

The number of iterations needed, and the nature of the termination problem (i.e., determiming when to stop the iterations) depends in art on the information dynamics of the algorithm. An examination of the dynamics of Grover's QSA algorithm starts by preparing all m qubits of the quantum computer in the state |s>=|0 . . . 0>. An elementary rotation in the direction of the sought state |x₀> with property ƒ(x₀)=1 is achieved by the gate sequence: $\begin{matrix} {{Q = {{- \left\lbrack \underset{\underset{k\quad{times}}{︸}}{\left( {I_{s}H^{\otimes {2m}}} \right) \cdot I_{x_{0}}} \right\rbrack} \cdot H^{\otimes {2m}}}},} & (2.17) \end{matrix}$ where the phase inversion I_(s) with respect to the initial state |s> is defined by I_(s)|S>=−|S>,1_(s)|S>=|S>(x≠s). The controlled phase inversion I_(x) ₀ with respect to the sought state |x₀> is defined in an analogous way. Because the state |x₀> is not known explicitly but only implicitly through the property ƒ(x₀)=1, this transformation is performed with the help of the quantum oracle. This task can be achieved by preparing the ancillary of the quantum oracle in the state ${❘{a_{0}\text{〉}}} = {{\frac{1}{\sqrt{2}}\left( {❘0} \right\rangle} - {\text{❘}1\text{〉}\text{)}}}$ as the unitary and Hermitian transformation: U_(F):|x,a>→|x,ƒ(x)⊕a>. Thus, |x> is an arbitrary element of the computational basis and |a> is the state of an additional ancillary qubit. As a consequence, one obtains the required properties for the phase inversion I_(x) ₀ , namely: ${\left. {{\left. {{❘x},{{{{f(x)} \oplus {a_{0}\text{〉}}} \equiv}❘x},{{0 \oplus {a_{0}\text{〉}}} = {{\frac{1}{\sqrt{2}}\quad\left\lbrack {{❘x},0} \right\rangle} - {\text{❘}x}}},{1\text{〉}}} \right\rbrack = {\text{❘}x}},a_{0},{{{{for}\quad x} \neq x_{0}}❘x},{{{{f(x)} \oplus {a_{0}\text{〉}}} \equiv}❘x},{{1 \oplus {a_{0}\text{〉}}} = {{\frac{1}{\sqrt{2}}\quad\left\lbrack {{❘x},1} \right\rangle} - {\text{❘}x}}},{0\text{〉}}} \right\rbrack = {{- \text{❘}}x}},a_{0},{{{for}\quad x} \neq x_{0}}$

In order to rotate the initial state |s> into the state |x₀> one can perform a sequence of n such rotations and a final Hadamard transformation at the end, i.e., |s_(fin)>=HQ^(n)|s_(in)>. The optimal number n of repetitions of the gate Q in Eq. (2.17) is approximately given by $\begin{matrix} {{n = {{\frac{\pi}{4{\arcsin\left( 2^{- \frac{m}{2}} \right)}} - \frac{1}{2}} \approx {\frac{\pi}{4}\sqrt{2^{m}}}}},{\left( {2^{m}{\bullet 1}} \right).}} & (2.18) \end{matrix}$

The matrix D_(n), which is called the diffusion matrix of order n, is responsible for interference in this algorithm. It plays the same role as QFT_(n) (Quantum Fourier Transform) in Shor's algorithm and of ^(n)H in Deutsch-Jozsa's and Simon's algorithms. This matrix is defined as $\begin{matrix} {{\left\lbrack D_{n} \right\rbrack_{i,j} = \frac{\left( {- 1} \right)^{1\quad{{AND}{({i = j})}}}}{2^{n/2}}},} & (2.19) \end{matrix}$ where i=0, . . . , 2^(n)−1, j=0, . . . , 2^(n)−1 n is a number of inputs.

The gate equation of Grover's QSA circuit is the following: G ^(Grover)=[(D _(n) {circumflex over (×)}I)·U_(F)]^(h)·(^(n+1)H)   (2.20)

The diagonal matrix elements in Grover's QSA-operators (as shown, for example, in Eq. (2.21 ) below) are connected to a database state to itself and the off-diagonal matrix elements are connected to a database state and to its neighbors in the database. The diagonal elements of the diffusion matrix have the opposite sign from the off-diagonal elements.

The magnitudes of the off-diagonal elements are roughly equal, so it is possible to write the action of the matrix on the initial state (see Table2.3). TABLE 2.3 Diffusion matrix definition D_(n) |0 . . . 0> |0 . . . 1> . . . |i> . . . |1 . . . 0> |1 . . . 1> |0 . . . 0> −1 + 1/2^(n−1) 1/2^(n−1) . . . 1/2^(n−1) . . . 1/2^(n−1) 1/2^(n−1) |0 . . . 1> 1/2^(n−1) −1 + 1/2^(n−1) . . . 1/2^(n−1) . . . 1/2^(n−1) 1/2^(n−1) . . . . . . . . . . . . . . . . . . . . . . . . |i> 1/2^(n−1) 1/2^(n−1) . . . −1 + 1/2^(n−1) . . . 1/2^(n−1) 1/2^(n−1) . . . . . . . . . . . . . . . . . . . . . . . . |1 . . . 0> 1/2^(n−1) 1/2^(n−1) . . . 1/2^(n−1) . . . −1 + 1/2^(n−1) 1/2^(n−1) |1 . . . 1> 1/2^(n−1) 1/2^(n−1) . . . 1/2^(n−1) . . . 1/2^(n−1) −1 + 1/2^(n−1)

For example: $\begin{matrix} {{{\begin{pmatrix} {- a} & b & b & b & b & b \\ b & {- a} & b & b & b & b \\ b & b & {- a} & b & b & b \\ b & b & b & {- a} & b & b \\ b & b & b & b & {- a} & b \\ b & b & b & b & b & {- a} \end{pmatrix}\begin{pmatrix} 1 \\ 1 \\ {- 1} \\ 1 \\ 1 \\ 1 \end{pmatrix}\frac{1}{\sqrt{N}}} = {\begin{pmatrix} {{- a} + {\left( {N - 3} \right)b}} \\ {{- a} + {\left( {N - 3} \right)b}} \\ {{+ a} + {\left( {N - 1} \right)b}} \\ {{- a} + {\left( {N - 3} \right)b}} \\ {{- a} + {\left( {N - 3} \right)b}} \\ {{- a} + {\left( {N - 3} \right)b}} \end{pmatrix}\frac{1}{\sqrt{N}}}},{{{where}\quad a} = {1 - b}},{b = {\frac{1}{2^{n - 1}}.}}} & (2.21) \end{matrix}$ If one of the states is marked, i.e., has its phase reversed with respect to that of the others, the multimode interference conditions are appropriate for constructive interference to the marked state, and destructive interference to the other states. That is, the population in the marked bit is amplified. The form of this matrix is identical to that obtained through the inversion about the average procedure in Grover's QSA. This operator produces a contrast in the probability density of the final states of the database of ${\frac{1}{N}\left\lbrack {a + {\left( {N - 1} \right)b}} \right\rbrack}^{2}$ for the marked bit versus ${\frac{1}{N}\left\lbrack {a - {\left( {N - 3} \right)b}} \right\rbrack}^{2}$ for the unmarked bits; where N is the number of bits in the data register.

Grover's algorithm gate in Eq, (2.20) is optimal and it is, thus, an efficient search algorithm. Thus, software based on the Grover algorithm can be used for search routines in a large database.

Grover's QSA includes a number of trials that are repeated until a solution is found. Each trial has a predetermined number of iterations, which determines the probability of finding a solution. A quantitative measure of success in the database search problem is the reduction of the information entropy of the system following the search algorithm. Entropy S^(Sh)(P_(i)) in this example of a single marked state is defined as $\begin{matrix} {{{S^{Sh}\left( P_{i} \right)} = {- {\sum\limits_{i = 1}^{N}{P_{i}\log\quad P_{i}}}}},} & (2.22) \end{matrix}$ where P_(i) is the probability that the marked bit resides in orbital i. In general, the Von Neumann entropy is not a good measure for the usefulness of Grover's algorithm. For practically every value of entropy, there exist states that are good initializers and states that are not. For example, ${{S\left( \rho_{{({n - 1})} - {mix}} \right)} = {{{\log_{2}N} - 1} = {S\left( \rho_{{(\frac{1}{\log_{2}N})} - {pure}} \right)}}},$ but when initialized in ρ_((n−1)−mix), the Grover algorithm is not good at guessing the market state. Another example may be given using pure states H|0><0|H and H|1><1|H. With the first, Grover finds the marked state with quadratic speed-up. The second is practically unchanged by the algorithm.

The information intelligent measure ℑ_(T)(|ψ>) of the state |ψ> with respect to the qubits in T and to the basis B=

|i₁>{circle around (×)} . . . {circle around (×)}|i_(n)>

is $\begin{matrix} {{{\mathfrak{J}}_{T}\left( \left. \psi \right\rangle \right)} = {1 - {\frac{{S_{T}^{Sh}\left( \left. \psi \right\rangle \right)} - {S_{T}^{VN}\left( \left. \psi \right\rangle \right)}}{T}.}}} & (2.23) \end{matrix}$

The intelligence of the QA state is maximal if the gap between the Shannon and the Von Neumann entropy in Eq. 2.23 for the chosen resultant qubit is minimal. Information QA-intelligent measure ℑ_(T)(|ψ>) and interrelations between information measures S_(T) ^(Sh)(|ψ>)≧S_(T) ^(VN)(|ψ>) are used together with entropic relations of the step-by-step natural majorization principle for solution of the QA-termination problem. From Eq. (2.17) it can be seen that for pure states $\begin{matrix} {\left. {\max\quad{{\mathfrak{J}}_{T}\left( \left. \psi \right\rangle \right)}}\mapsto{1 - {\min\left( \frac{{S_{T}^{Sh}\left( \left. \psi \right\rangle \right)} - {S_{T}^{VN}\left( \left. \psi \right\rangle \right)}}{T} \right)}}\mapsto{\min\quad{S_{T}^{Sh}\left( \left. \psi \right\rangle \right)}} \right.,{{S_{T}^{VN}\left( \left. \psi \right\rangle \right)} = 0},} & (2.24) \end{matrix}$

From Eq.(2.17) the principle of Shannon entropy minimum is described as follows.

According to Eq. (1.2), the Shannon entropy shows the lower bound of quantum complexity of the QA. It means that the criterion in Eq. (2.24) includes both metrics for design of an intelligent QSA: (i) minimal quantum query complexity; and (ii) optimal termination of the QSA with a successful search solution.

The Shannon information entropy is used for optimization of the termination problem of Grover's QSA. A physical interpretation of the information criterion begins with an information analysis of Grover's QSA based on using of Eq. (2.23). Eq (2.23) gives a lower bound on the amount of entanglement needed for a sucessful search and of the computational time. A QSA that uses the quantum oracle calls

O_(s)

as I−2|s><s| calls the oracle at least $T \geq {\left( {\frac{1 - P_{e}}{2\pi} + \frac{1}{\pi\quad\log\quad N}} \right)\sqrt{N}}$ times to achieve a probability of error P_(e). The information system includes the N-state data register. Physically, when the data register is loaded, the information is encoded as the phase of each orbital. The orbital amplitudes carry no information. While state-selective measurement gives as result only amplitudes, the information is hidden from view, and therefore, the entropy of the system is maximum: S_(init) ^(Sh)(P_(i))=−log(1/N)=log N. The rules of quantum measurement ensure that only one state will be detected each time.

If the algorithm works perfectly, the marked state orbital is revealed with unit efficiency, and the entropy drops to zero. Otherwise, unmarked orbitals may occasionally be detected by mistake. The entropy reduction can be calculated from the probability distribution, using Eq. (2.22). The minimum Shannon entropy criteria is used for successful termination of Grover's QSA and realized in this case in digital circuit implementation. P FIG. 23 shows the results of entropy analysis for Grover's QSA according to Eq. (2.16), for the case where n=7, ƒ(x₀)=1. FIG. 23 shows that minimum Shannon entropy is achieved on the 8^(th) iteration (the minimum value of the Shannon entropy is 1). A theoretical estimation for this case is ${\frac{\pi}{4}\sqrt{2^{7}}} \approx 9$ iterations. On the ninth iteration, the probability of the correct answer already becomes smaller, and as a result, measurement of the wrong basis vector may happen.

Application of the Shannon entropy termination condition is presented below in Section 6 (see FIGS. 48 and 49) for different input qubit numbers of Grover's QSA. The role of majorization and its relationship to Shannon entropy is discussed below.

Majorization describes what it means to say that one probability distribution is more disordered than another. In the quantum mechanical context, majorization provides an elegant way to compare two probability distributions or two density matrices. The step-by-step majorization is found in the known instance of efficient QA's, namely in the QFT, in Grover's QSA, in Shor's QA, in the hidden affine function problem, in searching by quantum adiabatic evolution and in deterministic quantum walks algorithm in continuous time solving a classical hard problem. Moreover, majorization has found many applications in classical computer science like stochastic scheduling, optimal Huffman coding, greedy algorithms, etc. Majorization is a natural ordering on probability distributions. One probability distribution is more uneven than another one when the former majorizes the later. Majorization implies an entropy decrease, thus the ordering concept introduced by majorization is more restrictive and powerful than that associated with the Shannon entropy.

The notion of ordering from majorization is more severe than the one quantified by the standard Shannon entropy. If one probability distribution majorizes another, a set of inequalities must hold to constrain the former probabilities with respect to the latter. These inequalities lead to entropy ordering, but the converse is not necessarily true. In quantum mechanics, majorization is at the heart of the solution of a large number of quantum information problems. In QA analysis, the problem distribution associated with the quantum state in the computational basis is step-by-step majorized until it is maximally ordered. Then a measurement provides the solution with high probability. The way such a detailed majorization emerges in both algorithmic families (as Grover's and Shor's QA's, and phase-estimation QA) is intrinsically different. The analyzed instance of QA's support a step-by-step Majorization Principle.

Grover's algorithm is an instance of the principle where majorization works step by step until the optimal target state is found. Extensions of this situation are also found in algorithms based in quantum adiabatic evolution and the family of quantum phase-estimation algorithms, including Shor's algorithm. In a QA, the time arrow is a majorization arrow.

Majorization is often defined as a binary relation noted by

on vectors in

^(d). Notations are fixed by introducing the following basic definitions:

For x,y ε

^(d), $x \prec {y\quad{iff}\left\{ \begin{matrix} {{{\sum\limits_{i = 1}^{k}x_{\lbrack i\rbrack}} \leq {\sum\limits_{i = 1}^{k}y_{\lbrack i\rbrack}}},} & {{k = 1},\ldots\quad,{d - 1}} \\ {{\sum\limits_{i = 1}^{d}x_{\lbrack i\rbrack}} \leq {\sum\limits_{i = 1}^{d}y_{\lbrack i\rbrack}}} & \quad \end{matrix} \right.}$ where [z_([1]) . . . z_([d])]:=sort_(↓) (z) denotes the descendingly sorted (non-increasing) ordering of zε

^(d). If it exists, the least element x₁ (greatest element x_(g)) of a partial order like majorization is defined by the condition x₁

x, ∀xε

^(d)(x

x_(g), ∀x ε

^(d))

For example, consider two vectors x, y εR^(d) such that ${{\sum\limits_{i = 1}^{d}x_{i}} = {{\sum\limits_{i = 1}^{d}y_{i}} = 1}},$

whose components represent two different probabilistic distributions. Three definitions of majorization are given in the table below: Definition 1 $x = {\sum\limits_{j}{p_{j}P_{j}y}}$ Definition 2 ${{\sum\limits_{i = 1}^{k}x_{i}} \leq {\sum\limits_{i = 1}^{k}y_{i}}},\quad{k = 1},\ldots,d$ Definition 3 x = Dy

Definition 1 says that distribution y majorizes distribution x, written x

y, if and only if, there exists a set of permutation matrices P_(j) and probabilities p_(j) such that $x = {\sum\limits_{j}{p_{j}P_{j}{y.}}}$

Because the probability distribution x can be obtained from y by means of a probabilistic sum, the definition given above provides the intuitive notion that the x distribution is more disordered than y.

An alternative and usually more practical definition of majorization can be stated in terms of a set of inequalities to be held between two distributions as described in Definition 2 above. Consider the components of the two vectors sorted in decreasing order, written as (z₁, . . . z_(d))≡z^(↓). Then, y^(↓) majorizes x^(↓) if and only if the following relations are satisfied: ${{\sum\limits_{i = 1}^{k}x_{i}} \leq {\sum\limits_{i = 1}^{k}y_{i}}},\quad{k = 1},\ldots\quad,{d.}$

Probability sums, such as the ones appearing in the previous expression are referred to as “cumulants”.

According to Definition 3 above, a real d×d matrix D=(D_(ij)) is said to be double stochastic if it has non-negative entries, and each row and column of D sums to 1. Then y majorizes x if and only if, there is a double stochastic matrix D such that x=Dy. Complementarily, the probability distribution x minorizes distribution y if and only if, y majorizes x.

A powerful relation involving majorization and common Shannon entropy ${S^{Sh}(x)} = {- {\sum\limits_{i = 1}^{d}{x_{i}\quad\log\quad x_{i}}}}$ of probability distribution x is that: If x

y, then −S^(Sh)(y)≧−S^(Sh)(x). This is a particular case of a more general result, stated in the following weak form: $\left. {x \prec y}\Rightarrow{{F(x)} < {F(y)}} \right.,\quad{{{where}\quad{F(x)}} \equiv {\sum\limits_{i}{f\left( x_{i} \right)}}},$ for any convex function ƒ:R→R This result can be extended to the domain of operator functionals. $\left. {\rho \prec \sigma}\Rightarrow{{F(\rho)} < {F(\sigma)}} \right.,{{{{wher}e}\quad{F(\rho)}} \equiv {\sum\limits_{i}{f\left( \lambda_{i} \right)}}},$ and λ_(i) are the eigenvalues of ρ, for any convex function ƒ:R→R

In particular, it follows that the von Neumann entropy S^(vN)(ρ)=S^(Sh)(λ(ρ)) also obeys ρ

σ

−S^(vN)(ρ)≦−S^(vN)(σ).

Thus, if one probability distribution or one density operator is more disordered than another in the sense of majorization, then it is also more disordered according to the Shannon or the von Neumann entropies, respectively.

As the two previous theorems show, there are many other functions that also preserve the majorization relation. Any such function, called Schur-convex, can in a sense be used as a measure of order. The majorization relation is a stronger notion of disorder, giving more information than any Schur-convex function. The Shannon and the von Neumann entropies quantify the order in some limiting conditions, namely when many copies of a system are considered.

There is a majorization principle underlying the way QA's work. Denote by |Ψ_(m)> the pure state representing the state of the register in a quantum computer at an operating stage labeled by m=0,1, . . . , M−1, where M is the total number of steps of algorithm, and let N be the dimension of the Hilbert space. Also, denote as

|i>

_(i=1) ^(N) the basis in which the final measurement is performed in the algorithm, one can naturally associate a set of sorted probabilities [p^(m) _([x])], x=0,1, . . . ,2^(n)−1 to this quantum state of n qubits in the following way: decompose the register state in the computational basis i.e., |Ψ_(m)>:=Σ_(x=0) ² ^(n) ⁻¹c^(m) _(x)|x> with

|x>:=|x₀s₁ . . . x_(n−1)>

_(x=0) ² ^(n) ⁻¹ denoting basis states in digital or binary notation, respectively and x:=Σ_(j=0) ^(n−1)x_(j)2^(j).

The sorted vectors to which majorization theory applies are precisely [p ^(m) _([x])]:=[|c^(m) _([x])|²]=[|<x|ψ_(m)>|²], where x=1, . . . , N, which corresponds to the probabilities of all the possible outcomes if the computation is stopped at stage m and a measurement is performed.

Thus, in a QA, one deals with probability densities defined in

₊ ^(d), with d=2^(n). With these ingredients, the main result can be stated as follows: in the QAs known so far, the set of sorted probabilities [p_([x]) ^(m)] associated with the quantum register at each step m are majorized by the corresponding probabilities of the next step ${\left\lbrack p_{\lbrack x\rbrack}^{m} \right\rbrack \prec \left\lbrack p_{\lbrack x\rbrack}^{m + 1} \right\rbrack},\left\{ {\begin{matrix} {{{\forall m} = 0},1,\ldots\quad,{M - 2}} \\ {{x = 0},1,\ldots\quad,{2^{n} - 1}} \end{matrix},{{{or}p^{(m)}} \prec p^{({m + 1})}},{p^{(m)} = {\left\lbrack p_{\lbrack x\rbrack}^{m} \right\rbrack.}}} \right.$

Majorization works locally in a QA, i.e., step by step, and not just globally (for the initial and final states). The situation given in the above equation is a step-by-step verification, as there is a net flow of probability directed to the values of highest weight, in such a way that the probability distribution will be steeper as time flows.

In physical terms, this can be stated as a very particular constructive interference behavior, namely, a constructive interference that has to satisfy the constraints given above step-by-step. The QA builds up the solution at each time step by means of this very precise reordering of probability distribution.

The majorization is checked on a particular basis. Step-by-step majorization is a basis-dependent concept. The preferred basis is the basis defined by the physical implementation of the quantum computation or computational basis. The principle is rooted in the physical possibility to arbitrarily stop the computation at any time and perform a measurement. The probability distribution associated with this physically meaningful action obeys majorization and the QA-stopping problem can be solved by the principle of minimum of Shannon entropy.

Working with probability amplitudes in the basis

|i>

_(i=1) ^(N), the action of a particular unitary gate at step m makes the amplitudes evolve to step m+1 in the following way: ${c_{i}^{m + 1} = {\sum\limits_{j = 1}^{N}{U_{ij}c_{j}^{m}}}},$ where U_(ij) are the matrix elements in the chosen basis of the unitary evolution operator (namely, the propagator from step m to step m+1 ). Inverting the evolution gives ${c_{i}^{m} = {\sum\limits_{j = 1}^{N}{A_{ij}c_{j}^{m + 1}}}},$ where A_(ij) are the matrix elements of the inverse unitary evolution (which is unitary as well).Taking the square modulus ${c_{i}^{m}}^{2} = {{\sum\limits_{j}{{A_{ij}}^{2}{c_{i}^{m + 1}}^{2}}} + {{interference}\quad{{terms}.}}}$

Should the interference terms disappear, majorization would be verified in a “natural” way between steps m and m+1 because the initial probability distribution could be obtained from the final one only by the action of a doubly stochastic matrix with entries |A_(ij)|². This is so-called “natural majorization”: majorization, which naturally emerges from the unitary evolution due to the lack of interference terms when making the square modulus of the probability amplitudes. There will be “natural minorization” between steps m and m+1 if and only if there is “natural majorization” between time steps m+1 and m.

Grover's QSA follows a step-by-step majorization. More concretely, each time Grover's operator is applied, the probability distribution obtained from the computational basis obeys the above constraints until the searched state is found. Furthermore, because of the possibility of understanding Grover's quantum evolution as a rotation in a two-dimensional Hilbert space the QA follows a step-by-step minorization when evolving far away from the marked state, until the initial superposition of all possible computational states is obtained again. The QA behaves such that majorization is present when approaching the solution, while minorization appears when escaping from it. A cycle of majorization and minorization emerges in the process proceeds through enough evolutions, due to the rotational nature of Grover's operator.

Grover's algorithm is an instance of the principle where majorization works step-by-step until the optimal target state is found. Extensions of this situation are also found in algorithms based in quantum adiabatic evolution and the family of quantum phase-estimation algorithms, including Shor's algorithm.

Grover's algorithm can conveniently be used as a starting point for majorization analysis of various quantum algorithms. This QA efficiently solves the problem of finding a target item in a large database. The algorithm is based on a kernel that acts symmetrically on the subspace orthogonal to the solution. This is clear from its construction K:=U_(s)U_(y0) U _(s):=2|s><s|−1, U _(y0):=1−2|y ₀ ><y _(0|) where |s>:=1/√{overscore (N)}Σ_(x)|x> and |y₀> is a searched item. The set of probabilities to obtain any of the N possible states in a database is majorized step-by-step along with the evolution of Grover's algorithm when starting from a symmetric state until the maximum probability of success is reached.

Shor's QA is analyzed inside of the broad family of quantum phase-estimation algorithms. A step-by-step majorization appears under the action of the last QFT when considered in the usual Coppersmith decomposition. The result relies on the fact that those quantum states that can be mixed by a Hadamard operator coming from the decomposition of the QFT only differ by a phase all along the computation. Such a property entails as well the appearance of natural majorization, in the way presented above. Natural majorization is relevant for the case of Shor's QFT. This particular algorithm manages step-by-step majorization in the most efficient way. No interference terms spoil the majorization introduced by the natural diagonal terms in the unitary evolution.

For efficient termination of QAs that give the highest probability of successful result, the Shannon entropy is minimal for the step m+1. This is the principle of minimum Shannon entropy for termination of a QA with the successful result. This result also follows from the principle of QA maximum intelligent state. For this case: ${{\max\quad{J_{T}\left( \left. \psi \right\rangle \right)}} = {1 - {\min\quad\frac{\left. \left. {H_{T}^{Sh}\left( {\psi} \right.} \right\rangle \right)}{T}}}},$ S_(T) ^(vN)(|ψ>)=0 (for pure quantum state). Thus, the principle of maximal intelligence of QAs include as particular case the principle of minimum Shannon entropy for QA-termination problem solution. 3. The Structure and Acceleration Method of Quantum Algorithm Simulation

The analysis of the quantum operator matrices that was carried out in the previous sections forms the basis for specifying the structural patterns giving the background for the algorithmic approach to QA modeling on classical computers. The allocation in the computer memory of only a fixed set of tabulated (pre-defined) constant values instead of allocation of huge matrices (even in sparse form) provides computational efficiency. Various elements of the quantum operator matrix can be obtained by application of an appropriate algorithm based on the structural patterns and particular properties of the equations that define the matrix elements. Each representation algorithm uses a set of table values for calculating the matrix elements. The calculation of the tables of the predefined values can be done as part of the algorithm's initialization.

3.1. Algorithmic Representation of the Grover's QA

FIGS. 24 a-c are flowcharts showing realization of such an approach for simulation of superposition (FIG. 24 a), entanglement (FIG. 24 b) and interference (FIG. 24 c) operators in Grover's QSA. Here n is a number of qubit, i and j are the indexes of a requested element, hc=2^(−(n+1)/2), dc1=2^(1−n)−1 and dc2=2^(1−n) are the table values.

In FIG. 24 a, in a block 2401, the i,j values are specified and provided to an initialization block 2402 where loops control variables ii :=i, jj:=0, and k:=0 are initialized, and calculation variable h:=1 is initialized. The process then proceeds to a decision block 2403. In the block 2403, if k is less than or equal to n, then the process advances to a decision block 2404; otherwise, the process advances to an output block 2407 where the output h*hc is computed (where hc=2^(−(n+1)/2)). In the decision block 2404, if (ii and jj and 1)=1, then the process advances to a block 2406; otherwise, the process advances to a block 2405. In the block 2406, the process sets h:=−h and advances to the block 2405. In the block 2405, the process sets ii:=ii SHR 1, jj:=jj SHR 1, and k:=k+1 (where SHR is a shift right operation), and then the process returns to the decision block 2403.

In FIG. 24 b, the inputs i, j in an input block 2411 are provided to an initialization block 2412 which sets ii:=i SHR 1, and jj:=SHR 1 and then advances to a decision block 2413. In the decision block 2413, if ii==jj, then the process advances to a decision block 2415, otherwise, the process advances to an output block 2414 which outputs 0. In the decision block 2415, if i=j, then the process advances to a block 2416; otherwise, the process advances to a block 2417. In the block 2416, the process sets u:=1 and then advances to a decision block 2418. In the block 2417, the process sets u:=0 and advances to the decision block 2418. In the decision block 2418, if f(ii)=1, then the process advances to a block 2420; otherwise, the process advances to an output block that outputs u. The block 2420 sets u:=NOT u and advances to the output block 2419.

In FIG. 24 c, if ((i XOR j) AND 1)=1 then the process outputs 0; otherwise, the process advances to a decision block 2423. In the decision block 2423, if i=j then the process outputs dc1, otherwise the process outputs dc2, where dc1=2^(1−n)−1 and dc2=2^(1−n).

As described above, the superposition and entanglement operators for Deutsch-Jozsa's QA are the same with superposition and entanglement operators for Grover's QSA (FIG. 24 a, FIG. 24 b, respectively). The interference operator representation algorithm for Deutsch-Jozsa's QA is shown in FIG. 24 d, where hc=2^(−n/2).

The entanglement operator for the Simon QA is shown in FIG. 24 e. Here m is an output dimension, ec1=2^(m)−1 and ec2=2^(m−1) are the table values. In FIG. 24 e, the inputs i,j are provided to an initialization block 2452 that sets ii:=i SHR m and jj :=SHR m. The process then advances to a decision block 2453. In the decision block 2453, if ii=jj then the process advances to a block 2454; otherwise, the process outputs 0. In the block 2454, the process sets u:=f(ii), ii:=i AND ec1, jj:=j AND ec1, and k:=ec2; after which the process advances to a decision block 2455. In the decision block 2455, if (u AND k)=0, then the process advances to a decision block 2456; otherwise, the process advances to a decision block 2457. In the decision block 2456, if k<=ii, and k>jj, then the process outputs 0; otherwise, the process advances to a decision block 2451. In the decision block 2457, if k<=ii AND k<=jj, then the process outputs 0; otherwise, the process advances to a decision block 2456. In the decision block 2451, if k>ii AND k<=jj, then the process outputs 0; otherwise, the process advances to a block 2459. In the decision block 2456, if k>ii AND k>jj then the process outputs 0; otherwise, the process advances to the block 2459. In the block 2459, the process sets ii:=jj AND (k−1), jj:=jj AND (k=1), and k:=K SHR 1, after which, the process advances to a decision block 2458. In the decision block 2458, if k>0, then the process loops back to the block 2455; otherwise, the process outputs 1.

Superposition and interference operators for the Simon QA are identical (see Table 2.1) and are shown by flowchart in FIG. 24 f. In FIG. 24 f, the inputs i,j are provided to a decision block 2552. In the decision block 2552, if ((i XOR j) AND (2^(n−1))=0) then the process advances to a block 2553; otherwise, the process outputs 0. In the block 2553, the process sets ii:=i SHR n, jj :=j SHR n, h:=1, and k:=1, and then advances to a decision block 2556. In the decision block 2556, if k<=n, then the process advances to a decision block 2557; otherwise, the process outputs h*hc. In the decision block 2557, if (((ii AND jj) AND 1)=1) then the process sets J:=−h and advances to a block 2558; otherwise, the process advances directly to the block 2558. In the block 2558, the process sets ii:=SHR 1, jj :=jj SHR 1, k:=k+1 and then loops back to the decision block 2556.

FIG. 24 g is a flowchart showing calculation of the interference operator from the Shor QA. The Shor interference operator is relatively more complex, as explained above. Superposition and entanglement operators for the Shor algorithm are the same as the Simon's QA operators shown in FIG. 24 f and FIG. 24 e. The Shor interference operator is based on the Quantum Fourier Transformation (QFT) with table values c1=2^(−n/2) and c2=π/2^(n−1).

In FIG. 24 g, the inputs i,j are provided to a decision block 2602. In the decision block 2602, if ((i XOR j) AND (2^(n)−1))=0 then the process advances to a block 2603; otherwise, the process outputs the complex number (0,0). In the block 2603, the process sets i:=i SHR n, and j :=j SHR n, and then advances to a decision block 2604. In the decision block 2604, if i=0, then the process outputs the complex number (c1,0); otherwise, the process advances to a decision block 2607. In the decision block 2607, if j=0, then the process outputs the complex number (c1,0); otherwise, the process advances to a block 2608, In the block 2608, the process sets a:=c1*cos(i*j*c2), and b:=c1*sin(i*j*c2), and the outputs (a,b).

The time required for calculating the elements of an operator's matrix during a process of applying a quantum operator is generally small in comparison to the total time of performing a quantum step. Thus, the time burden created by exponentially-increasing memory usage tends to be less, or at least similar to, the time burden created by computing matrix elements as needed. Moreover, since the algorithms used to compute the matrix elements tend to be based on fast bit-wise logic operations, the algorithms are amenable to hardware acceleration.

Table 3.1 shows comparisons of the traditional and as-needed matrix calculation (when the memory used for the as-needed algorithm (Memory*) denotes memory used for storing the quantum system state vector. TABLE 3.1 Different approaches comparison: Standard (matrix based) and algorithmic based approach Standard Calculated Matrices Qubits Memory, MB Time, s Memory* Time, s 1 1 0.03 ≈0 ≈0 8 18 5.4 0.008 0.0325 11 1048 1411 0.064 2.3 16 — — 2 4573 24 — — 512 3 * 10⁸ 64 — — — —

The results shown in Table 3.1 is based on the results of testing the software realization of Grover QSA simulator on a personal computer with Intel Pentium III 1 GHz processor and 512 Mbytes of memory. One iteration of the Grover QSA was performed.

Table 3.1 shows that significant speed-up is achieved by using the algorithmic approach as compared with the prior art direct matrix approach. The use of algorithms for providing the matrix elements allows considerable optimization of the software, including the ability to optimize at the machine instructions level. However, as the number of qubits increases, there is an exponential increase in temporal complexity, which manifests itself as an increase in time required for matrix product calculations.

Use of the structural patterns in the quantum system state vector and use of a problem-oriented approach for each particular algorithm can be used to offset this increase in temporal complexity. By way of explanation, and not by way of limitation, the Grover algorithm is used below to explain the problem-oriented approach to simulating a QA on a classical computer.

3.2. Problem-Oriented Approach Based on Structural Pattern of QA State Vector.

Let n be the input number of qubits. In the Grover algorithm, half of all 2^(n−1) elements of a vector making up its even components always take values symmetrical to appropriate odd components and, therefore, need not be computed. Odd 2^(n) elements can be classified into two categories:

The set of m elements corresponding to truth points of input function (or oracle); and

The remaining 2^(n)−m elements.

The values of elements of the same category are always equal.

As discussed above, the Grover QA only requires two variables for storing values of the elements. Its limitation in this sense depends only on a computer representation of the floating-point numbers used for the state vector probability amplitudes. For a double-precision software realization of the state vector representation algorithm, the upper reachable limit of q-bit number is approximately 1024. FIG. 25 shows a state vector representation algorithm for the Grover QA. In FIG. 25, i is an element index, ƒ is an input function, vx and va corresponds to the elements' category, and v is a temporal variable. The input i is provided to a decision block 2502. In the decision block 2502, if ƒ(i SHR 1)=1, then the process proceeds to a block 2503; otherwise, the process proceeds to a block 2507. In the block 2503, the process sets v:=vx and then advances to a decision block 2504. In the block 2507, the process sets v:=va and then advances to the decision block 2504. In the decision block 2504, if (i AND 1)=1), then the process outputs −v; otherwise, the process outputs v. Thus, the number of variables used for representing the state variable is constant.

A constant number of variables for state vector representation allows reconsideration of the traditional schema of quantum search simulation. Classical gates are used not for the simulation of appropriate quantum operators with strict one-to-one correspondence but for the simulation of a quantum step that changes the system state. Matrix product operations are replaced by arithmetic operations with a fixed number of parameters irrespective of qubit number.

FIG. 26 shows a generalized schema for efficient simulation of the Grover QA built upon three blocks, a superposition block H 2602, a quantum step block UD 2610 and a termination block T 2605. FIG. 26 also shows an input block 2601 and an output block 2607. The UD block 2610 includes a U block 2603 and a D block 2604. The input state from the input block 2601 is provided to the superposition block 2602. A superposition of states from the superposition block 2602 is provided to the U block 2603. An output from the U block 2603 is provided to the D block 2604. An output from the D block 2604 is provided to the termination block 2605. If the termination block terminates the iterations, then the state is passed to the output block 2607; otherwise, the state vector is returned to the U block 2603 for another iteration.

As shown in FIG. 27, the superposition block H 2602 for Grover QSA simulation changes the system state to the state obtained traditionally by using n+1 times the tensor product of Walsh-Hadamard transformations. In the process shown in FIG. 27, vx:=hc, va:=hc, and vi:=0., where hc=2^(−(n+1)/2) is a table value.

The quantum step block UD 2610 that emulates the entanglement and interference operators is shown on FIGS. 28 a-c. The UD block 2610 reduces of the temporal complexity of the quantum algorithm simulation to linear dependence on the number of executed iterations. The UD block 2610 uses ore-calculated table values dc1=2^(n)−m and dc2=2^(n−1). In the U block 2603 shown in FIG. 28 a, vx:=−vx and vi:=vi+1. In the D block 2604 shown in FIG. 28 b, v:=m*vx+dc1*va, v:=v/dc2, vx:=v=vx, and va:=v−va in the UD block shown in FIG. 28 c, v:=dc1*va=m*vx, v:=v/dc2, vx:=v+vx, va:=v−va, and vi:=vi+1.

The termination block T 2605 is general for all quantum algorithms, independently of the operator matrix realization. Block T 2605 provides intelligent termination condition for the search process. Thus, the block T 2605 controls the number of iterations through the block UD 2610 by providing enough iterations to achieve a high probability of arriving at a correct answer to the search problem. The block T 2605 uses a rule based on observing the changing of the vector element values according to two classification categories. The T block 2605 during a number of iterations, watches for values of elements of the same category monotonically increase or decrease while values of elements of another category changed monotonically in reverse direction. If after some number of iteration the direction is changed, it means that an extremum point corresponding to a state with maximum or minimum uncertainty is passed. The process can proceed here using direct values of amplitudes instead of considering Shannon entropy value, thus, significantly reducing the required number of calculations for determining the minimum uncertainty state that guarantees the high probability of a correct answer. The Termination algorithm realized in the block T 2605 can use one or more of five different termination models:

-   -   Model 1: Stop after a predefined number of iterations;     -   Model 2: Stop on the first local entropy minimum;     -   Model 3: Stop on the lowest entropy within a predefined number         of iterations;     -   Model 4: Stop on a predefined level of acceptable entropy;         and/or     -   Model 5: Stop on the acceptable level or lowest reachable         entropy within the predefined number of iterations.

Note that models 1-3 do not require the calculation of an entropy value. FIGS. 29-31 show the structure of the termination condition blocks T 2605.

Since time efficiency is one of the major demands on such termination condition algorithm, each part of the termination algorithm is represented by a separate module, and before the termination algorithm starts, links are built between the modules in correspondence to the selected termination model by initializing the appropriate functions' calls.

Table 3.2 shows components for the termination condition block T 2605 for the various models. Flow charts of the termination condition building blocks are provided in FIGS. 29-34 TABLE 3.2 Termination block construction Model T B′ C′ 1 A — — 2 B PUSH — 3 C A B 4 D — — 5 C A E

The entries A, B, PUSH, C, D, E, and PUSH in Table 5 correspond to the flowcharts in FIGS. 29, 30, 31, 32, 33, 34 respectively.

In model 1, only one test after each application of quantum step block UD is needed. This test is performed by block A. So, the initialization includes assuming A to be T, i.e., function calls to T are addressed to block A. Block A is shown in FIG. 29. As shown in FIG. 29, the A block checks to see if the maximum number of iterations has been reached, if so, then the simulation is terminated, otherwise, the simulation continues.

In model 2, the simulation is stopped when the direction of modification of categories' values are changed. Model 2 uses comparison of the current value of vx category with value mvx that represents this category value obtained in previous iteration:

-   -   (i) If vx is greater than mvx, its value is stored in mvx, the         vi value is stored in mvi, and the termination block proceeding         to the next quantum step.     -   (ii) If vx is less than mvx, it means that the vx maximum is         passed and the process needs to set the current (final) value of         vx :=o mvx, vi :=mvi, and stop the iteration process. So, the         process stores the maximum of vx in mvx and the appropriate         iteration number vi in mvi. Here block B, shown in FIG. 30 is         used as the main block of the termination process. The block         PUSH, shown in the FIG. 31 a is used for performing the         comparison and for storing the vx value in mvx (case a). A POP         block, shown in FIG. 31 b is used for restoring the mvx value         (case b). In the PUSH block of FIG. 31 a, if |vx|>|mvx|, then         mvx:=vx, mva:=va, mvi:=vi, and the block returns true;         otherwise, the block returns false. In the POP block of FIG. 31         b, if |vx|<=|mvx|, then vx:=mvx, va:=mva, and vi:=mvi.

The model 3 termination block checks to see that a predefined number of iterations is not exceeded (using block A in FIG. 29):

-   -   (i) If the check is successful, then the termination block         compares the current value of vx with mvx. If mvx is less than,         it sets the value of mvx equal to vx and the value of mvi equal         to vi. If mvx is less using the PUSH block, then perform the         next quantum step.     -   (ii) If the check operation fails, then (if needed) the final         value of vx equal to mvx, vi equal to mvi (using the POP block)         and the iterations are stopped.

The model 4 termination block uses a single component block D, shown in FIG. 33. The D block compares the current Shannon entropy value with a predefined acceptable level. If the current Shannon entropy is less than the acceptable level, then the iteration process is stopped; otherwise, the iterations continue.

The model 5 termination block uses the A block to check that a predefined number of iterations is not exceeded. If the maximum number is exceeded, then the iterations are stopped. Otherwise, the D block is then used to compare the current value of the Shannon entropy with the predefined acceptable level. If acceptable level is not attained, then the PUSH block is called and the iterations continue. If the last iteration was performed, the POP block is called to restore the vx category maximum and appropriate vi number and the iterations are ended.

FIG. 35 shows measurement of the final amplitudes in the output state to determine the success or failure of the search. If |vx|>|va|, then the search was successful; otherwise, the search was not successful.

Table 3.3 lists results of testing the optimized version of Grover QSA simulator on personal computer with Pentium 4 processor at 2 GHz. TABLE 3.3 High probability answers for Grover QSA Qbits Iterations Time 32 51471 0.007 36 205887 0.018 40 823549 0.077 44 3294198 0.367 48 13176794 1.385 52 52707178 5.267 56 210828712 20.308 60 843314834 81.529 64 3373259064 328.274

The theoretical boundary of this approach is not the number of qubits, but the representation of the floating-point numbers. The practical bound is limited by the front side bus frequency of the personal computer.

Using the above algorithm, a simulation of a 1000 qubit Grover QSA requires only 96 seconds for 10⁸ iterations.

The above approach can be used for simulation of the Deutsch-Jozsa's QA. The general schema of Deutsch-Jozsa's QA simulation is shown on FIG. 36, where an input state 3601 is provided to a quantum HUD block 3602 which generates an output state 3603.

The structure of the HUD block 3602 is shown in FIG. 37, where the input 3601 is provided to an initialization block 3702. The initialization block 3702 sets i:=0 and v:=0, and then the process advances to a decision block 3703. In the decision block 3703, if i<2^(n), then the process advances to a decision block 3704; otherwise, the process advances to an output block which outputs v:=v*vc, where vc=2^(−n−1/2).

The quantum block HUD 2610 is applied only once to obtaining of the final state. Here v represents the vector |0..00> amplitude, ƒ is an input function of order n, vc=2^(−n−1/2) is a table value. After applying the block HUD, the value of v is considered in correspondence with Table 3.4. TABLE 3.4 Possible answers for Deutsch-Jozsa's problem Value of v Answer 0 f is balanced $\frac{1}{\sqrt{2}}$ f is constant 0 $- \frac{1}{\sqrt{2}}$ f is constant 1 Otherwise f is something else 4. General Software and Hardware Approach in QC Based on Fast Algorithm Simulation

The structure of the generalized approach in QA simulation is shown in FIG. 39. From the available database of the QAs, its matrix representation is extracted. Then matrix operators are replaced with developed algorithmic or problem-oriented corresponding approaches, thus spatio-temporal characteristics of the algorithm will improve.

The simulation is then performed, and after obtaining final state vector, the measurement takes place in order to extract the result. Final results can be obtained by having the information about the algorithm and results of the measurement. After interpretation, results can be applied in the selected field of applications.

5. Simulation of Quantum Algorithms with Reduced Number of Quantum Operators: Application of Entanglement-Free Quantum Control Algorithm for Robust KB Design of FC

The simulation techniques described above for simulating quantum algorithms on classical computers permit design of new QAs, such as, for example, entanglement-free quantum control algorithms. The simulation of a QA can be made more efficient by arranging the QA to be entanglement-free. In one embodiment, the entanglement-free algorithm is used in the context of soft computing optimization for the design process of a robust Knowledge Base (KB) for a Fuzzy Controller (FC).

5.1. Models of Entanglement-Free Algorithms and Classical Efficient Simulation of Quantum Strategies without Entanglement.

Entanglement-free quantum speed-up algorithms are useful for many applications, including, but not limited to, simulation results in the robust KB-FC design process. The explanation of the entanglement-free quantum efficient algorithm begins with a statement of the following problem: Given an integer N function ƒ: x→mx+b, where x, m,b εZ_(N), find m. The classical analysis reveals that no information about m can be obtained with only one evolution of the function ƒ. Conversely, given the unitary operator U_(ƒ) acting in a reversible way in the Hilbert space Hil_(N){circle around (×)}Hil_(N) such that U _(ƒ)|x>|y>=|x>|y+ƒ(x)>,   (5.1) (where the sum is to be interpreted as modulus N). A QA can be used to solve this problem with only one query to U_(ƒ).

A QA structure for solving the above problem is described as follows. Take N=2^(n), being n the number of qubits. The QA for efficiently solving the above problem includes the following operations:

-   -   1. Prepare two registers of n qubits in the state |0 . . .         >|ψ₁>εH_(N){circle around (×)}H_(N), where |ψ₁>=QFT(N)⁻¹|1>, and         QFT(N)⁻¹ denotes the inverse quantum Fourier transform in a         Hilbert space of dimension N.     -   2. Apply QFT (N) over the first register.     -   3. Apply U_(ƒ) over the whole quantum state.     -   4. Apply QFT(N)⁻¹ over the first register.     -   5. Measure the first register and output the measured value.

This QA leads to the solution of the problem. The analysis raises two observations concerning the way both entanglement and majorization behave in the computational process. In the first step of the algorithm, the quantum state is separable, noting that the QFT (and its inverse) are applied on a well-defined state in the computational basis leads to a perfectly separable state. Actually, this separability holds also step-by-step when the decomposition for the QFT is considered, such as the Coppersmith's decomposition. That is, the quantum state |0 . . . 0>|ψ₁> is un-entangled.

The second step of the algorithm corresponds to a QFT in the first register. This action leads to a step-by-step minorization of the probability distribution of the possible outcomes while it does not create any entanglement. Moreover, natural minorization is at work due to the absence of interference terms.

It can be verified that the quantum state $\begin{matrix} {\left. \psi_{1} \right\rangle = {\frac{1}{\sqrt{N}}{\sum\limits_{j = 0}^{N - 1}{{\mathbb{e}}^{{- 2}\pi\quad\frac{i}{N}}\left. j \right\rangle}}}} & (5.2) \end{matrix}$ is an eigenstate of the operator |y>→|y+ƒ(x)) with eigenvalue e^(2πiƒ(x)/N).

After the third step, the quantum state reads $\begin{matrix} {{\frac{1}{\sqrt{N}}{\sum\limits_{j = 0}^{N - 1}{{\mathbb{e}}^{2\pi\quad i\frac{f{(x)}}{N}}\left. \psi_{1} \right\rangle}}} = {\frac{{\mathbb{e}}^{2\pi\quad i\frac{b}{N}}}{\sqrt{N}}\underset{{First}\quad{Register}}{\underset{︸}{\left( {\sum\limits_{x = 0}^{N - 1}{\mathbb{e}}^{2\pi\quad i\frac{mx}{N}}} \right)}}\left. \psi_{1} \right\rangle}} & (5.3) \end{matrix}$

The probability distribution of possible outcomes has not been modified, thus not affecting majorization. Furthermore, the pure quantum state of the first register in Eq.(5.3) can be written as QFT (N) m) (up to a phase factor), so this step has not created any entanglement among the qubits of the system.

In the fourth step of the algorithm, the action of the operator QFT(N)⁻¹ over the first register leads to the state e^(2πib/N)|m>|ψ₁>.

A subsequent measurement in the computational basis over the first register provides the desired solution.

The inverse QFT naturally majorizes step-by-step the probability distribution attached to the different outputs. However, the separability of the quantum state still holds step-by-step.

The QA is more efficient than any of its possible classical counterparts, as it only needs a single query to the unitary operator U_(ƒ) to obtain the solution. One can summarize this analysis of majorization for the present QA as follows: The entanglement-free efficient QA for finding a hidden affine function shows a majorization cycle based on the action of QFT(N) and QFT(N)⁻¹.

It follows that there can exist a quantum computational speed-up without the use of entanglement. In this case, no resource increases exponentially. Yet, a majorization cycle is present in the process, which is rooted in the structure of both the QFT and the quantum state.

Quantum mechanics affects game theory, and game theory can be used to show classical-quantum strategy without entanglement. For certain games, a suitable quantum strategy is able to beat any classical strategy. It is possible to demonstrate design of quantum strategies without entanglement using two simple examples of entanglement-free games: the PQ-game and the card game.

Consider, for example, the penny flipping game PQ PEANY FLIP game. The game is penny flipping, where player P places a penny head up in a box, after which player Q, then player P, and finally player Q again, can choose to flip the coin or not, but without being able to see it. If the coin ends up being head up, player Q wins, otherwise player P wins. The winning (or cheating, depending upon one's perspective) quantum strategy of Q now involves putting the penny into a superposition of head up and down. Since player P is allowed to interchange only up and down he is not able to change that superposition, so Q wins the game by rotating the penny back to its initial state.

Q produces a penny and asks P to place it in a small box, head up. Then Q, followed by P, followed by Q, reaches into box, without looking at the penny, and either flips it over or leaves it as it is. After Q's second turn they open the box and Q wins if the penny is head up.

Q wins every time they play, using the following quantum game gate: $\left. \psi_{fin} \right\rangle = {{\underset{Q\quad{strategy}}{\underset{︸}{H}} \cdot \underset{P\quad{strategy}}{\underset{︸}{\sigma_{x}\left( I_{2} \right)}} \cdot \underset{Q\quad{strategy}}{\underset{︸}{H}}}\underset{{Initial}\quad{state}}{\underset{︸}{\left. 0 \right\rangle}}}$

and the following quantum strategy: Initial state and strategy Player strategy Result of operation 0⟩ $\underset{H}{\overset{Q}{\longrightarrow}}$ $\left. \left. {\frac{1}{\sqrt{2}}\left( {\left. 0 \right\rangle +} \right.1} \right\rangle \right)$ Classical strategy $\underset{\sigma_{x}{({{or}\quad I_{2}})}}{\overset{P}{\longrightarrow}}$ $\left. \left. {{{{\left. \left. {{{\left. {\frac{1}{\sqrt{2}}\left(  \right.1} \right\rangle +}}0} \right\rangle \right)\quad{or}\quad\frac{1}{\sqrt{2}}\left( {0} \right\rangle} +}}1} \right\rangle \right)$ Quantum strategy $\underset{H}{\overset{Q}{\longrightarrow}}$ 0⟩

Here 0 denotes “head” and 1 denotes “tail”, and $\sigma_{x} = {\begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix} \equiv {NOT}}$ implements P's possible action of flipping the penny over. Q's quantum strategy of putting the penny into the equal superposition of “head” and “tail” on his first turn means that whether P flips the penny over or not, it remains in an equal superposition which Q rotates back to “head” by applying the Hadamard transformation H again, since $H = {{H^{- 1}\quad{and}\quad\frac{1}{\sqrt{2}}\left( {\left. 1 \right\rangle + \left. 0 \right\rangle} \right)} = {\frac{1}{\sqrt{2}}{\left( {\left. 0 \right\rangle + \left. 1 \right\rangle} \right).}}}$ After measurement, Q receives the state |0>. The second application of the Hadamard transformation plays the role of constructive interference. So when they open the box, Q always wins without using entanglement.

If Q were restricted to playing classically, i.e., to implementing only σ_(x) or I₂ on his turns, an optimal strategy for both players would be to flip the penny over or not with equal probability on each turn. In this case, Q would win only half the time, so he does substantially better by playing quantum mechanically.

Now, consider the interesting case of a classical-quantum card game without entanglement. In the classical game, one player A can always win with the probability $\frac{2}{3}.$ But if the other player B performs quantum strategy, he can increase his winning probability from $\frac{1}{3}$ to $\frac{1}{2}.$ In this case, B is allowed to apply quantum strategy and the original unfair game turns into a fair and zero-sum game, i.e., the unfair classical game becomes fair in the quantum world. In addition, this strategy does not use entanglement.

The classical model of the card game is explained as follows. A has three cards. The first card has one circle on both sides, the second has one dot on both sides, and the third card has one circle on one side and one dot on the other. In the first step, A puts the three cards into a black box. The cards are randomly placed in the box after A shakes it. Both players cannot see what happens in the box. In the second step, B takes one card from the box without flipping it. Both players can only see the upper side of the card. A wins one coin if the pattern of the down side is the same as that of the upper side and loses one coin when the patterns are different. It follows that A has a $\frac{2}{3}$ probability of winning and B only has a $\frac{1}{3}$ chance of winning. B is in a disadvantageous situation and the game is unfair to him. Any rational player will not play the game with A because the game is unfair. In order to attract B to play with him, before the original second step, A allows B to have one chance to operate on the cards. That is, B has one step query on the box. In the classical world, B can only attain one card information after the query. Because the card is in the box, so what B knows is only one upper side pattern of the three cards. Except for this, he knows nothing about the three cards in the black box. So in the classical field, even having this one step query, B still will be in a disadvantaged state and the game is still unfair.

Now consider the quantized approach to the card game. In the quantum field, the whole game is changed. The game turns into a fair zero-sum game and both players are in equal situation. Consider first the case when A uses the classical strategy and B uses the quantum strategy. In the first step, A puts the cards in the box and shakes the box, that is, he prepares the initial state randomly. The card state is |0> if the pattern in the upper side is circle and |1> if it is dot. So the upper sides of the three cards in the box can be described as |r>=|r₀>|r₁>|r₂>, where r₀, r₁, r₂ ε

0,1

, which means |r₀>, |r₁>, r₂> are all eigenstate superpositions of |0> and |1>.

After the first step of the game, A gives the black box to B. Because A thinks in classical way, in his mind B cannot get information about all upper side patterns of the three cards in the box. So A can still win with higher probability. But what B uses is quantum strategy: He replaces the classical one step query with one step quantum query. The following shows how B queries the box.

Assume that B has a quantum machine that applies an unitary operator U on its three input qubits and gives three output qubits. This machine depends on the state |r> in the box that A gives B. The explicit expression of U and its relation with |r> is as following U=U₀{circle around (×)}U₁{circle around (×)}U₂ where $U_{k} = \left\{ {\begin{matrix} {I_{2} = {{\begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix}\quad{if}\quad r_{k}} = 0}} \\ {\sigma_{x} = {{\begin{pmatrix} 1 & 0 \\ 0 & {- 1} \end{pmatrix}\quad{if}\quad r_{k}} = 1}} \end{matrix} = {\begin{pmatrix} 1 & 0 \\ 0 & {\exp\left\{ {{\mathbb{i}\pi}\quad r_{k}} \right\}} \end{pmatrix}.}} \right.$

The processing of the query is shown in FIG. 40. After the process, the output state is |ψ_(fin)>=(H{circle around (×)}H{circle around (×H)U(H{circle around (×)}H{circle around (×)}H)|000>=(HU₀H)|0>(HU₁H)|0>(HU₂H)|0>.

Because ${{H\quad U_{k}H} = {{\frac{1}{2}\begin{pmatrix} 1 & 1 \\ 1 & {- 1} \end{pmatrix}\begin{pmatrix} 1 & 0 \\ 0 & {\mathbb{e}}^{{\mathbb{i}\pi}\quad r_{k}} \end{pmatrix}\begin{pmatrix} 1 & 1 \\ 1 & {- 1} \end{pmatrix}} = {\frac{1}{2}{\begin{pmatrix} {1 + {\mathbb{e}}^{{\mathbb{i}\pi}\quad r_{k}}} & {1 - {\mathbb{e}}^{{\mathbb{i}\pi}\quad r_{k}}} \\ {1 - {\mathbb{e}}^{{\mathbb{i}\pi}\quad r_{k}}} & {1 + {\mathbb{e}}^{{\mathbb{i}\pi}\quad r_{k}}} \end{pmatrix}.{So}}}}}\quad$ ${H\quad U_{k}H\left. 0 \right\rangle} = {{{\frac{1 + {\mathbb{e}}^{{\mathbb{i}\pi}\quad r_{k}}}{2}\left. 0 \right\rangle} + {\frac{1 - {\mathbb{e}}^{{\mathbb{i}\pi}\quad r_{k}}}{2}\left. 1 \right\rangle}} = \left\{ {\begin{matrix} {{\left. 0 \right\rangle\quad{if}\quad r_{k}} = 0} \\ {{\left. 1 \right\rangle\quad{if}\quad r_{k}} = 1} \end{matrix} = \left. r_{k} \right\rangle} \right.}$

From the above equation, it follows that B can obtain the complete information about the upper patterns of all the three cards through one query. There are only two possible kinds of output states in the black box, which is |0>|0>|1> or |1>|1>|0>, that is two circles and one dot on the upper side or two dots and one circle. Assume that the state of the cards after the first step is two circles and one dot, i.e., |0>|0>|1>. After the one-step query, B knows the complete information about the upper patterns, but has no individual information about which upper pattern corresponds to which card. Then he takes one card out of the box to see what pattern is on the upper side. If B finds out that he is in a disadvantage situation, the upper pattern of the card is dot (|1>), he refuses to play with A in this turn because he knows the down side is dot definitely. Otherwise if the upper side pattern is circle (|0>), then he knows that the down side pattern is circle |0> or dot |1>. So he continues his turn because the probability of winning is $\frac{1}{2}.$ B will continue the game because he has probability $\frac{1}{2}$ to win. Hence, the game becomes fair and is also zero-sum.

One of the reasons why the quantum strategies in games are better than classical strategies is that the initial state is maximally entangled. The quantum strategy in the card game applied by B includes no entanglement and is still better than the classical strategy.

The initial state input to the quantum machine is |0>|0>|0>, which is separable. After the Hadamard transformation, the state is $\frac{1}{\sqrt{2^{3}}}\left( {\left. 0 \right\rangle +} \right.1{\text{〉)} \otimes \left( {\left. 0 \right\rangle +} \right.}1\text{〉}{\text{)} \otimes {\left( {\left. 0 \right\rangle + \left. 1 \right\rangle} \right).}}$

Performed by U, the state becomes $\frac{1}{\sqrt{2^{3}}}\left( {\left. 0 \right\rangle + {\mathbb{e}}^{{\mathbb{i}}\quad\pi\quad r_{0}}} \right.1\text{〉}{\text{)} \otimes \left( {\left. 0 \right\rangle + {\mathbb{e}}^{{\mathbb{i}}\quad\pi\quad r_{1}}} \right.}1\text{〉}{\text{)} \otimes {\left( {\left. 0 \right\rangle + {{\mathbb{e}}^{{\mathbb{i}}\quad\pi\quad r_{2}}\left. 1 \right\rangle}} \right).}}$ And the states, after the second Hadamard transformation, are in the output state |r₀>r₁>r₂>. The state is described by the tensor products of the states of the individual qubits, so it is unentangled. And because the operators (H and U) are also tensor products of the individual local operators on these qubits, in this quantum game there is no entanglement applied.

Entanglement is important for static games (such as the Prisoner's Dilemma) but may not be necessary in dynamic games (such as the PQ-game and the card game). In static games, each player can only control his qubit and his operation is local. So in the classical world, the operation of one player cannot have influence on others in the operational process. But in the quantum field, through entanglement, the strategy used by one player can influence not only himself, but also his opponents. In dynamic games, players can control all qubits at any step. So, as in QAs, in dynamic games, players can use quantum strategies without entanglement to solve problems, even entangled quantum strategies can be re-described with other quantum strategies without entanglement.

Thus, if B is given a quantum strategy (e.g., a quantum query) against his classical opponent A, the classical opponent cannot always win with high probability. Both players are on equal footing and the game is a fair zero-sum game. The quantum game includes no entanglement and quantum-over-classical strategy is achieved using only interference. Thus, quantum strategy can still be powerful without entanglement.

In general, the PQ game can be described as follows: Definition Main operations (i) A Hilbert space H (the possible states of the game) with N = dim H (ii) An initial state ψ₀ ∈ H (iii) Subset Q_(i) ⊂ U (N), i ∈ {1, . . ., k + 1} - the elements of Q_(i) are the moves Q chooses among on turn i (iv) Subset P_(i) ⊂ S_(N), i ∈ {1, . . ., k}, where S_(N) is the permutation group on N elements - the elements of P_(i) are the moves P chooses among on turn i (v) A projection operator Π on H (the subspace W_(Q) fixed by Π consists of the winning states for Q)

Since only P and Q play, these are two-player games; they are zero-sum since when Q wins, P loses, and vice versa. A pure quantum strategy for Q is a sequence u_(i) ε Q_(i). A pure (classical) strategy for P is a sequence s_(i) ε P_(i), while a mixed (classical) strategy for P is a sequence of probability distributions ƒ_(i):P_(i)→[0,1]. If both Q and P play pure strategies, the corresponding evolution of the PQ-game is described by quantum game gate: ${❘{\psi_{fin}\text{〉}}} = {\prod\limits_{k}\quad{u_{k + 1}s_{k}u_{k}\text{❘}\psi_{i\quad n}{\text{〉}.}}}$

After Q's last move, the state of the game is measured with Π. According to the rules of quantum mechanics, the players observe the eigenvalue 1 with probability Tr(ψ^(†)Πψ); this is lo the probability that the state is projected into W_(Q) and Q wins. More generally, if P plays a mixed strategy, the corresponding evolution of the PQ-game is described by ${\rho_{f} = {{u_{k + 1}\left( {\sum\limits_{s_{k} \in P_{k}}{{f_{k}\left( s_{k} \right)}s_{k}u_{k}\quad\ldots\quad{u_{2}\left( {\sum\limits_{s_{1} \in P_{1}}{{f_{1}\left( s_{1} \right)}s_{1}u_{1}\rho_{0}u_{1}^{\dagger}s_{1}^{\dagger}}} \right)}u_{2}^{\dagger}\quad\ldots\quad u_{k}^{\dagger}s_{k}^{\dagger}}} \right)}u_{k + 1}^{\dagger}}},$ where ρ₀=|ψ₀>{circle around (×)}<ψ₀ ^(†)|. Again, after Q's last move ρ_(ƒ) is measured with Π; the probability that ρ_(ƒ) is projected into W_(Q){circle around (×)}W_(q) ^(†) and Q wins is Tr (Πρ_(ƒ)). 1 5 An equilibrium state is a pair of strategies, one for P and one for Q, such that neither player can improve his probability of winning by changing his strategy while the other does not. In general, unlike the simple case of the PQ-game, W_(Q)=W_(Q)(

s_(i)

) or W_(Q)=W_(Q)(

ƒ_(i)

), i.e., the conditions for Q's win can depend on P's strategy. There are mixed/quantum equilibria at which Q does better than he would at any mixed/mixed equilibrium; there are some QAs, which outperform classical ones. 5.2. Interrelations Between QAs and Quantum Games Structures.

A QA for an oracle problem can be understood as a quantum strategy for a player in a two-player zero-sum game in which the other player is constrained to play classically. This correspondence can be formalized and the following development gives examples of games (and hence, oracle problems) for which the quantum player can do better than that would be possible classically. In the general case, entanglement (or some replacement resource) is required. However, an efficient quantum search of a “sophisticated” database requires no entanglement at any time step. A quantum-over-classical reduction in the number of queries is achieved using only interference, not entanglement, within the usual model of quantum computation. TABLE 5.1 Oracle functions Number Title of oracle Type Definition 1 The phase oracle P_(f) $\left. {{\left. {{\left. \left. {{\left. {x} \right\rangle }b} \right\rangle\rightarrow{\exp\quad\left\{ \frac{2\pi\quad{{{if}{\quad\quad}(x)} \cdot b}}{2^{n}} \right\}} \right.}x} \right\rangle }b} \right\rangle$ 2 The standard oracle S_(f) x⟩b⟩→x⟩b ⊕ f  (x)⟩ 3 The minimal (an erasing) oracle M_(f) x⟩→f  (x)⟩

Returning to the quantum oracle evaluation of multi-valued Boolean functions discussed in section 3, consider a multi-valued function F that is one-to-one and where the size of its domain and range is the same. The problem can be formulated as follows: Given an oracle ƒ(a, x):

0,1

^(n)×

0,1

^(n)→

0,1

and a fixed (but hidden) value a₀, obtain the value of a₀ by querying the oracle ƒ(a₀, x). The algorithm evaluates the multi-valued Boolean function F through oracle calls and the main goal is to minimize the number of such oracle calls (the query complexity) using a quantum mechanism.

Query complexity is one of the issues in quantum computation, especially in proving lower bounds of QAs with oracles. Generally speaking, there are two popular techniques to derive quantum lower bounds: (i) polynomials; and (ii) adversary methods. For the bounded error case, evaluations of AND and OR functions need Θ(√{overscore (N)}) number of queries, while parity and majority functions at least $\frac{N}{2}$ and Θ(N), respectively. Alternatively, define ${F\left( {x_{0},\quad\ldots\quad,x_{N - 1}} \right)} = \left\{ \begin{matrix} {{a\quad{if}\quad x_{a}} = {{1\quad{and}\quad x_{j}} = {{0\quad{for}\quad{all}{\quad\quad}j} \neq a}}} \\ {{undefined}\quad{otherwise}} \end{matrix} \right.$ then evaluating this function F is the same as Grover's QSA. Moreover, if one defines ${F\left( {x_{0},\quad\ldots\quad,x_{N - 1}} \right)} = \left\{ \begin{matrix} {{a\quad{if}\quad x_{a}} = {{{a \cdot {i\left( {{mod}\quad 2} \right)}}\quad{for}\quad{all}{\quad\quad}0} \leq i \leq {N - 1}}} \\ {{undefined}\quad{otherwise}} \end{matrix} \right.$ then this is the same as the so-called Bernstein-Varzirani problem. Some lower bounds are easier to obtain using the quantum adversary method than the polynomials one. The lower bound of a bounded-error quantum query complexity of read-once functions is Ω(√{overscore (N)}).

Quantum evaluation assumes that it is possible to obtain the value of variable x_(i) only through an oracle O (i). Since both functions are one-to-one, and their domain and range are of the same size, it is possible to formulate the problem as follows.

Let n be an integer ≧1 and N=2^(n). Then, given an oracle defined as a function ƒ(a,x):

0,1

^(n)×

0,1

^(n)→

0,1

such that ƒ(a,x)≠ƒ(a₂,x) for some x if a₁≠a₂, and a fixed (and hidden) value a, it is desired to obtain the value a, using the oracle ƒ(a, x).

For the Grover QSA, the definition ${f\left( {x,a} \right)} = \left\{ {{\begin{matrix} 1 \\ {0,} \end{matrix}\begin{matrix} {{{if}\quad x} = a} \\ {otherwise} \end{matrix}},} \right.$ completely specifies the problem. This oracle is sometimes called the exactly quantum (EQ) oracle and is denoted by EQ_(a)(x). Table 5.2 shows the case ƒ(x, a)=EQ_(a)(x) for n=4.

As can be seen from Table 5.2, ƒ(a, x) is given by a truth-table of size N×N, where each row gives the function F of the previous definition. For example, F (1, 0, . . . , 0)=0000 from the first row of the Table 5.2. If the hidden value a is 0010 for example, the oracle returns value 1 only when it is queried with x=0010 .

For the Bernstein-Vazirani problem, the similar definition is given as ƒ(a, x)=a·x(mod 2),

which is called the inner product (IP) oracle and denoted by IP_(a) (x). Its truth-table for n=4 is given in Table 5.3. TABLE 5.2 x a $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 1 \\ 0 & 1 & 0 & 1 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 1 & 1 & 1 & 1 \\ 0 & 0 & 1 & 1 \\ 0 & 1 & 0 & 1 \end{matrix}\quad$ $\begin{matrix} 1 & 1 & 1 & 1 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 1 \\ 0 & 1 & 0 & 1 \end{matrix}\quad$ $\begin{matrix} 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 \\ 0 & 0 & 1 & 1 \\ 0 & 1 & 0 & 1 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 1 & 1 \end{matrix}\quad$ $\begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{pmatrix} \equiv I$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 1 \\ 0 & 1 & 1 & 0 \\ 0 & 1 & 1 & 1 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{pmatrix} \equiv I$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{matrix} 1 & 0 & 0 & 0 \\ 1 & 0 & 0 & 1 \\ 1 & 0 & 1 & 0 \\ 1 & 0 & 1 & 1 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{pmatrix} \equiv I$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{matrix} 1 & 1 & 0 & 0 \\ 1 & 1 & 0 & 1 \\ 1 & 1 & 1 & 0 \\ 1 & 1 & 1 & 1 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}\quad$ $\begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{pmatrix} \equiv I$

The above assumed that the domain of the Boolean function has the same size as its range. More general cases, e.g., the size of the range is larger than the domain, will be mentioned briefly below.

The quantum query complexity is a function of the number of oracle calls needed to obtain the hidden value a. The query complexity for the EQ-oracle is Θ(√{overscore (N)}), while only O(1) for the IP-oracle. A difference exist between the EQ- and IP-oracles. The difference can be shown by comparing their truth-tables given in Tables 5.21 and 5.32, where Table 5.3 shows a truth-table for ${{f\left( {x,a} \right)} = {{IP}_{a} = \left\{ {{a \cdot x} = {\sum\limits_{i}{a_{i} \cdot {x_{i}\left( {{mod}\quad 2} \right)}}}} \right\}}},{n = 4.}$

One can immediately see TABLE 5.3 x a $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 1 \\ 0 & 1 & 0 & 1 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 1 & 1 & 1 & 1 \\ 0 & 0 & 1 & 1 \\ 0 & 1 & 0 & 1 \end{matrix}\quad$ $\begin{matrix} 1 & 1 & 1 & 1 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 1 \\ 0 & 1 & 0 & 1 \end{matrix}\quad$ $\begin{matrix} 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 \\ 0 & 0 & 1 & 1 \\ 0 & 1 & 0 & 1 \end{matrix}\quad$ $\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 1 & 1 \end{matrix}\quad$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$ $\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 1 \\ 0 & 1 & 1 & 0 \\ 0 & 1 & 1 & 1 \end{matrix}\quad$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 1 & {\quad 1} & {\quad 1\quad} & 1 \\ 1 & {\quad 0} & \left. 1 \right\rceil & 0 \\ 1 & \left\lfloor 1 \right. & {\quad 0\quad} & 0 \\ 1 & {\quad 0} & {\quad 0\quad} & 1 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 1 & {\quad 1} & {\quad 1\quad} & 1 \\ 1 & {\quad 0} & \left. 1 \right\rceil & 0 \\ 1 & \left\lfloor 1 \right. & {\quad 0\quad} & 0 \\ 1 & {\quad 0} & {\quad 0\quad} & 1 \end{pmatrix}{\quad\quad}$ $\begin{matrix} 1 & 0 & 0 & 0 \\ 1 & 0 & 0 & 1 \\ 1 & 0 & 1 & 0 \\ 1 & 0 & 1 & 1 \end{matrix}\quad$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 1 & {\quad 1} & {\quad 1\quad} & 1 \\ 1 & {\quad 0} & \left. 1 \right\rceil & 0 \\ 1 & \left\lfloor 1 \right. & {\quad 0\quad} & 0 \\ 1 & {\quad 0} & {\quad 0\quad} & 1 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 1 & {\quad 1} & {\quad 1\quad} & 1 \\ 1 & {\quad 0} & \left. 1 \right\rceil & 0 \\ 1 & \left\lfloor 1 \right. & {\quad 0\quad} & 0 \\ 1 & {\quad 0} & {\quad 0\quad} & 1 \end{pmatrix}{\quad\quad}$ $\begin{matrix} 1 & 1 & 0 & 0 \\ 1 & 1 & 0 & 1 \\ 1 & 1 & 1 & 0 \\ 1 & 1 & 1 & 1 \end{matrix}\quad$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 1 & {\quad 1} & {\quad 1\quad} & 1 \\ 1 & {\quad 0} & \left. 1 \right\rceil & 0 \\ 1 & \left\lfloor 1 \right. & {\quad 0\quad} & 0 \\ 1 & {\quad 0} & {\quad 0\quad} & 1 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 1 & {\quad 1} & {\quad 1\quad} & 1 \\ 1 & {\quad 0} & \left. 1 \right\rceil & 0 \\ 1 & \left\lfloor 1 \right. & {\quad 0\quad} & 0 \\ 1 & {\quad 0} & {\quad 0\quad} & 1 \end{pmatrix}{\quad\quad}$ $\begin{pmatrix} 0 & {\quad 0} & {0\quad} & 0 \\ 0 & \left\lceil 1 \right. & {0\quad} & 1 \\ 0 & {\quad 0} & \left. 1 \right\rfloor & 1 \\ 0 & {\quad 1} & {1\quad} & 0 \end{pmatrix}{\quad\quad}$

The table for IP_(a) is well-balanced in terms of the numbers of 0's and 1's, but quite unbalanced for EQ_(a). The natural consequence is that there should be intermediate oracles between those extreme cases for which the query complexity is also intermediate between Θ(√{overscore (N)}) and O(1). Furthermore, these intermediate oracles can be characterized by some parameter in such a way that the query complexity depends upon this parameter value and both EQ_(a) and IP_(a) are obtained as special cases.

For these two oracles, the EQ-oracle (defined as f (a, x)=1 iff x=a) and the IP-oracle (defined as ƒ(a,x)=a·x mod2 ), the query complexity is Θ(√{overscore (N)}) for the EQ-oracle while only O(1) for the IP-oracle. To investigate what causes this large difference, the parameter K can be introduced as the maximum number of 1's in a single column of T_(ƒ) where T_(ƒ) is the N×N truth-table of the oracle ƒ(a, x). The quantum complexity is strongly related to this parameter K.

To develop models and estimation of quantum lower/upper bounds, let T_(ƒ) be the truth-table of an oracle ƒ(a,x) like the oracles given in Tables 5.2 and 5.3. Assume without loss of generality that the number of 1's is less than or equal to the number of 0's in each column of T_(ƒ). Let #_(i)(T_(ƒ)) denote the number of 1's $\left( {\leq \frac{N}{2}} \right)$ in the i-th column of T_(ƒ) and #(T_(ƒ))=max₁ #_(i)(T_(ƒ)). This single parameter #(T_(ƒ)) plays a key role, namely: (i) Let ƒ(a, x) be any oracle and K=#(T_(ƒ)). Then the query complexity of the search problem for ƒ(a,x) is ${\Omega\left( \sqrt{\frac{N}{K}} \right)};$ This lower bound is tight in the sense that it is possible to construct an explicit oracle whose query complexity is ${O\left( \sqrt{\frac{N}{K}} \right)}.$ This oracle again includes both EQ and IP oracles as special cases; (iii) The tight complexity, ${\Theta\left( {\frac{N}{K} + {\log\quad K}} \right)},$ is also obtained for the classical case. Thus, the QA needs a quadratically fewer number of oracle calls when K is small and this merit become larger when K is large, e.g., log K versus a constant when K=cN.

The quantum oracle models and reduction of query number problems frame the context for the discussion for the database search problem, that is, to identify a specific record in a large database. Formally, records are labeled

0,1, . . . , N−1

where, for convenience when writing the numbers in binary, it is convenient to take N=2^(n) where n is a positive integer. In one embodiment, a quantum database search involves a database in which, when queried about a specific number, the oracle responds only that the guess is correct or not. On a classical reversible computer, one can implement a query by a pair of register (x,b), where x is an n-bit string representing the guess, and b is a single bit which the database will use to respond to the query. If the guess is correct, the database responds by adding 1(mod2) to b ; if it is incorrect, it adds 0 to b. That is, the response of the database is the operation: |x>|b>→|x>|b⊕ƒ_(a)(x)>, where ƒ_(a)(x)=1 when x=a, 0 otherwise. Thus, if b changes, one knows that the guess is correct. Classically, it takes N−1 queries to solve this problem with probability 1.

The following oracles are defined in Table 5.4 for a general function ƒ:

0,l

^(m) →

0,1

^(n). Here x and b are strings of m and n bits respectively, |x> and |b> the corresponding computational basis states, and ⊕ is addition modulo 2^(n). The oracles P_(ƒ) and S_(ƒ) are equivalent in power: each can be constructed by a quantum circuit containing just one copy of the other. Assuming m=n and assuming ƒ is a known permutation on the set

0,1

^(n) then M_(ƒ) is a simple invertible quantum map associated to ƒ. Intuitively, erasing oracles seem at least as strong as standard ones, though it is not clear how to simulate the latter with the former without also having access to an oracle that map |x> to |ƒ⁻¹(x)>. One-way functions provide a clue: if ƒ is one-way, then (by assumption) |x>|ƒ(x)> can be computed efficiently, but if |ƒ(x)> could be computed efficiently given |x> then so could |x> given |ƒ(x)>, and hence ƒ could be inverted. For some problems, an exponential gap between query complexity given a standard oracle and query complexity given an erasing oracle.

QAs work by supposing that they will be realized in a quantum system, which can be in a superposition of “classical” states. These states form a basis for the Hilbert space whose elements represent states of the quantum system. More generally, Grover's QSA works with quantum queries which are linear combinations Σc_(x,b)|x,b>, where c_(x,b) are complex numbers satisfying Σ|c_(x,b)|²=1. The operations in QAs are unitary transformations, the quantum mechanical generalization of reversible classical operations. Thus, the operation of the database that Grover considered is implemented on superpositions of queries by a unitary transformation, which takes |x,b> to |x>|b⊕ƒ_(a)(x)>. By using $\left\lfloor {\frac{\pi}{4}\sqrt{N}} \right\rfloor$ quantum queries, it identifies the answer with probability close to 1: The final vectors for the N possible answers a are nearly orthogonal.

Consider one of the guessing game type that uses Grover's QSA for guessing of any number between 0 and N−1 and to discuss the role of different quantum oracle models in the reduction of query number. Assume, in PQ-game, the player Q boasts that if P picks any number between 0 and N−1, inclusive, he can guess it. P knows the Grover's QSA and realizes that for N=2^(n), the player Q can determine the number he picks with high probability by playing the following strategy: TABLE 5.4 0…0, 0⟩ $\underset{{H^{\otimes n} \otimes H}\quad\sigma_{x}}{\overset{Q}{\longrightarrow}}$ $\left. \left. {{\left. {{{\frac{1}{\sqrt{N}}\underset{x\quad 00}{\overset{n - 1}{\quad\sum}}}\quad }x} \right\rangle \otimes \frac{1}{\sqrt{2}}}\left( {\left. 0 \right\rangle -} \right.1} \right\rangle \right)$ ⇒ (u₁) $\underset{s{(f_{a})}}{\overset{P}{\longrightarrow}}$ $\begin{matrix} {{\frac{1}{\sqrt{N}}\underset{x = 0}{\overset{n - 1}{\quad\sum}}\left( {- 1} \right)^{\delta_{xa}}}\quad} \\ \left. \left. {{\left. {{\quad }x} \right\rangle \otimes \frac{1}{\sqrt{2}}}\left( {\left. 0 \right\rangle -} \right.1} \right\rangle \right) \end{matrix}\quad$ ⇒ (s₁) $\underset{H^{\otimes n} \otimes {I_{2} \circ {s{(f_{0})}} \circ H^{\otimes n}} \otimes I_{2}}{\overset{Q}{\longrightarrow}}$ . . . , ⇒ (u₂) using the following quantum game gate: G=[H ^({circle around (×)}n) {circle around (×)}I ₂ ∘s(ƒ₀)∘H ^({circle around (×)}n) {circle around (×)}I ₂ ]∘s(ƒ_(a))∘[H ^({circle around (×)}n) {circle around (×)}Hσ _(x)] which can be efficiently simulated using classical computer. Where a ε[0,N−1] is P's chosen number, moves (s₁) and (u₂) are repeated a total of $k = \left\lfloor {\frac{\pi}{4}\sqrt{N}} \right\rfloor$ times, i.e., (s_(k)= . . . =s₁) and (u_(k)= . . . =u₂). For ƒ:Z₂ ^(n)→Z₂, the oracle s(ƒ) is the permutation (and hence unitary transformation) defined by (see Table 5.4) s(ƒ)|x,b>=|x,b⊕ƒ(x)>. Each P's moves s_(i) can be thought of as the response of an oracle, which computes ƒ_(x)(x):=δ_(xa) to respond to the quantum query defined by the state after the action of quantum strategy (u_(i)). After O(√{overscore (N)}) such queries, a measurement by Π=|a><a|{circle around (×)}I₂ returns a win for Q with probability bounded above $\frac{1}{2},$ i.e., Grover's QSA determines a with high probability.

If Q were to play classically, he could query P about a specific number at each time, but on the average it would take $\frac{N}{2}$ turns to guess a. A classical equilibrium is for P to choose a random, and for Q to choose a permutation of N=2^(n) uniformly at random and guess numbers in the corresponding order. Even when P plays such a mixed strategy, Q's quantum strategy is optimal; together they define a mixed / quantum equilibrium.

Knowing all this, P responds that he will play, but that Q should only get one guess, not $k = {\left\lfloor {\frac{\pi}{4}\sqrt{N}} \right\rfloor.}$ Q protests that this is hardly fair, but he will play, as long as P tells how close his guess is to the chosen number. P agrees, and they play. Q wins every step.

In this case, Q uses a slightly improved Berstein-Vazirani algorithm: Guess x and answer a are vectors in Z₂ ^(n), so x·a depends on the cosine of the angle between these vectors. Thus, it seems reasonable to define the oracle “how close a guess is to the answer” to be the oracle response ƒ_(a)(x)

g_(a)(x):=x·a. Then Q plays as follows: 0…0, 0⟩ $\underset{{H^{\otimes n} \otimes H}\quad\sigma_{x}}{\overset{Q}{\longrightarrow}}$ $\left. \left. {{\left. {{{\frac{1}{\sqrt{N}}\underset{x\quad 00}{\overset{n - 1}{\quad\sum}}}\quad }x} \right\rangle \otimes \frac{1}{\sqrt{2}}}\left( {\left. 0 \right\rangle -} \right.1} \right\rangle \right)$ ⇒ (u₁) $\underset{s{(g_{a})}}{\overset{P}{\longrightarrow}}$ $\left. \left. {{\left. {{{\frac{1}{\sqrt{N}}\underset{x = 0}{\overset{n - 1}{\quad\sum}}\left( {- 1} \right)^{x \cdot a}}\quad }x} \right\rangle \otimes \frac{1}{\sqrt{2}}}\left( {\left. 0 \right\rangle -} \right.1} \right\rangle \right)$ ⇒ (s₁) $\underset{H^{\otimes n} \otimes I_{2}}{\overset{Q}{\longrightarrow}}$ $\left. \left. {{\left. {a} \right\rangle \otimes \frac{1}{\sqrt{2}}}\left( {\left. 0 \right\rangle -} \right.1} \right\rangle \right)$ ⇒ (u₂) using the following (more simple) quantum game gate: G=[H^(e,crc ×)n{circle around (×)}I₂]∘g_(a)(x)∘[H^({circle around (×)}n){circle around (×)}Hσ_(x)]. For Π=|a><a|{circle around (×)}I₂ again, Q wins with probability 1, having queried P only once.

The oracle, which responds in the Berstein-Vazirani algorithm with x·a (mod2), is a “sophisticated database” by comparison with Grover's oracle in QSA, which only responds that a guess is correct or incorrect. And finally, entanglement is not required in the Berstein-Vazirani QA for quantum-over-classical improvement. The improved version of the Berstein-Vazirani algorithm does not create entanglement at any time step, but still solves this oracle problem with fewer queries than is possible classically.

Quantum computing manipulates quantum information by means of unitary transformations, such as superpositions. For instance, a single-qubit Walsh-Hadamard operation H transforms a qubit from |0> to |+> and from |1> to |−>. When H is applied to a superposition such as |+>, it follows by the linearity of quantum mechanics that the resulting state is ½

(|0>+|1>)+(|0>−|1>)

=0. This illustrates the phenomenon of destructive interference, by which component |1> of the state is erased. Consider now an n-qubit quantum register initialized to |0^(n)>. Applying a Walsh-Hadamard transform to each of these qubits yields an equal superposition of all n-bit classical states: $\left. 0^{n} \right\rangle\overset{H}{\rightarrow}{\frac{1}{\sqrt{2^{n}}}{\sum\limits_{x = 0}^{2^{n} - 1}{\left. x \right\rangle.}}}$

Consider now a function ƒ:

0,1

^(n)→

0,1

, that maps n-bit strings to a single bit. On a quantum computer, because unitary transformations are reversible, it is natural to implement it as a unitary transformation U_(ƒ) that maps |x>|b> to |x>|b⊕ƒ(x)>, where x is an n-bit string, b is a single bit, and “⊕” denotes the Exclusive −OR (XOR). Schematically, ${\left. x \right\rangle\left. b \right\rangle}\overset{U_{f}}{\rightarrow}{\left. x \right\rangle{\left. {b \oplus {f(x)}} \right\rangle.}}$

Quantum computers can solve some problems exponentially faster than any classical computer provided the input is given as an oracle, even if bounded errors are allowed. In this model, some function ƒ:

0,1

^(n)→

0,1

is given as a black-box, which means that the only way to obtain knowledge about ƒ is to query the black-box on chosen inputs. In the corresponding quantum oracle model, a function ƒ is provided by a black-box that applies unitary transformation U_(ƒ) to any chosen quantum state, as described by: ${\left. x \right\rangle\left. b \right\rangle}\overset{U_{f}}{\rightarrow}{\left. x \right\rangle{\left. {b \oplus {f(x)}} \right\rangle.}}$

The goal of the algorithm is to learn some property of the function ƒ.

The linearity of quantum mechanics gives rise to quantum parallelism and two important phenomena, the first of which is quantum parallelism. It is possible to compute ƒ on arbitrarily many classical inputs by a single application of U_(ƒ) to a suitable superposition: ${\sum\limits_{x}{\alpha_{x}\left. x \right\rangle\left. b \right\rangle}}\overset{U_{f}}{\rightarrow}{\sum\limits_{x}{\alpha_{x}\left. x \right\rangle{\left. {{f(x)} \oplus b} \right\rangle.}}}$

When this is done, the additional output qubit may become entangled with the input register;

The second phenomena is phase kick-back: The outcome of ƒ can be recorded in the phase of the input register rather than being XOR-ed to the additional output qubit: ${{\left. x \right\rangle\left.  - \right\rangle}\overset{U_{f}}{\rightarrow}{\left( {- 1} \right)^{f{(x)}}\left. x \right\rangle\left.  - \right\rangle}};$ ${\sum\limits_{x}{\alpha_{x}\left. x \right\rangle\left.  - \right\rangle}}\overset{U_{f}}{\rightarrow}{\sum\limits_{x}{{\alpha_{x}\left( {- 1} \right)}^{f{(x)}}\left. x \right\rangle{\left.  - \right\rangle.}}}$

The fundamental questions in quantum computing are following:

The common measure of efficiency for computer algorithms is the amount of time required to obtain the solution as function of the input size. In the oracle context, this usually means the number of queries needed to gain a predefined amount of information about the solution. In contrast, one can fix a maximum number of oracle calls and to try to obtain as much Shannon information as possible about the correct answer. In this model, when a single oracle query is performed, the probability of obtaining the correct answer is better for the QA than for the optimal classical algorithm, and the information gained by that single query is higher. This is true even when no entanglement is ever present throughout the quantum computation and even when the state of the quantum computer is arbitrarily close to being totally mixed. QAs can be better than classical algorithms even when the state of the computer is almost totally mixed, which means that it contains an arbitrary small amount of information. It means that QAs can be better than classical algorithms even when no entanglement is present.

It is often believed that entanglement is essential for quantum computing. However, in many cases, quantum computing without entanglement is better than anything classically achievable, in terms the reliability of the outcome after a fixed number of oracle calls. It means that: (i) entanglement is not essential for all QAs; and (ii) some advantage of QAs over classical algorithms persists even when the quantum state contains an arbitrary small amount of information—that is, even when the state is arbitrarily close to being totally mixed.

A special quantum state known as a pseudo-pure state (PPS) can be used to describe entanglement-free quantum computation. PPS occurs naturally in the framework of Nuclear Magnetic Resonance (NMR) quantum computing. Consider any pure state |ψ> on n-qubits and some real number 0≦ε≦1. PPS has the following form: ρ_(PPS) ^(n)≡ε|ψ><ψ|+(1−ε)I.

It is a mixture of a pure state |ψ> with the totally mixed state $I = {\frac{1}{2^{n}}I_{2^{n}}}$ (where I₂ _(n) denotes the identity matrix of order 2^(n)). For example, the Werner state is a special case of PPS.

To understand why these states are called pseudo-pure, consider what happens if a unitary operation U is performed on state ρ=ρ_(PPS) ^(n).

First, the purity parameter ε of the PPS is conserved under a unitary transformation, since $\rho\overset{U}{\rightarrow}{U\quad\rho\quad U^{\dagger}}$ and UI U^(†)=I , and UρU^(†)=εU|ψ><ψ|U^(†)+(1−ε)UI U^(†=ε|φ><φ|+()1−ε)I, where |φ>=U|ψ>. In other words, unitary operations affect only the pure part of these states, leaving the totally mixed part unchanged and leaving the pure proportion ε intact.

For a PPS there exists some bias ε below which these states are never entangled. Thus, for any number n of qubits, a state ρ_(PPS) ^(n) is separable whenever ${ɛ < \frac{1}{1 + 2^{{2n} - 1}}},$ regardless of its pure part |ψ>.

Consider the density matrix ρ_(PPS) ^(n)≡ε|ψ><ψ|+(1−ε)I. Its candidate ensemble probability satisfies ${w\left( {{\overset{\rightarrow}{n}}_{1},\ldots\quad,{\overset{\rightarrow}{n}}_{N}} \right)} = {{\frac{1 - ɛ}{\left( {4\quad\pi} \right)^{N}} + {ɛ\quad{w\left( {{\overset{\rightarrow}{n}}_{1},\ldots\quad,{\overset{\rightarrow}{n}}_{N}} \right)}}} \geq {\frac{1 - {ɛ\left( {1 + 2^{{2N} - 1}} \right)}}{\left( {4\quad\pi} \right)^{N}}.}}$

Therefore, ρ_(ε) is separable if $ɛ \leq \frac{1}{1 + 2^{{2N} - 1}}\underset{N\rightarrow\infty}{\approx}{\frac{2}{4^{N}}.}$

Here again, the density matrices in the neighborhood of the maximally mixed matrices are separable, and one obtains a lower bound on the size of the separable neighborhood. For N≧4 the bound is better than the bound $ɛ \leq {\frac{1}{\left( {1 + 2^{N - 1}} \right)^{N - 1}}.}$

One illustrative example is the Greenberger-Horne-Zeilinger (GHZ) state, a state of three qubits with density matrix $\begin{matrix} {\rho_{GHZ} = {\frac{1}{2}\left( {\left. 111 \right\rangle + \left. 222 \right\rangle} \right)\left( \left\langle {111{{+ \left\langle {{222\left.  \right)} =} \right.}}} \right. \right.}} \\ {= {\frac{1}{8}\left( {{1_{2} \otimes 1_{2} \otimes 1_{2}} + {1_{2} \otimes \sigma_{3} \otimes \sigma_{3}} + {\sigma_{3} \otimes 1_{2} \otimes \sigma_{3}} +} \right.}} \\ {{\sigma_{3} \otimes \sigma_{3} \otimes 1_{2}} + {\sigma_{1} \otimes \sigma_{1} \otimes \sigma_{1}} - {\sigma_{1} \otimes \sigma_{2} \otimes \sigma_{2}} -} \\ {{{\sigma_{2} \otimes \sigma_{1} \otimes \sigma_{2}} - {\sigma_{2} \otimes \sigma_{2} \otimes \sigma_{1}}},} \end{matrix}$ which gives a representation $\begin{matrix} {{w_{GHZ}\left( {{\overset{\rightarrow}{n}}_{1},\ldots\quad,{\overset{\rightarrow}{n}}_{N}} \right)} = {\frac{1}{\left( {4\quad\pi} \right)^{3}}\left\lbrack {1 + {9\left( {{c_{\quad 1}c_{\quad 2}} + {c_{\quad 2}c_{\quad 3}} + {c_{\quad 1}c_{\quad 3}}} \right)} +} \right.}} \\ {{27s_{1}s_{2}s_{3}\cos\quad\left( {\varphi_{1} + \varphi_{2} + \varphi_{3}} \right\rbrack} \geq {- {\frac{26}{\left( {4\quad\pi} \right)^{3}}.}}} \end{matrix}$

Here c_(j)≡cos θ_(j) and s_(j)≡sin θ_(j), and the minimum occurs at θ₁=θ₂=θ₃=π/2 and φ₁+φ₂+φ₃=π. Thus, the mixed state ρ_(ε)=(1−ε)M₈+ερ_(GHZ) is separable if ε≦1/27, in which case, no measurement can reveal evidence of quantum entanglement.

Up to this point it has been assumed that the number of qubits is being fixed, and the boundary between separability and non-separability has been described as the amount of noise, specified by ε, changes. Now, the discussion shifts to thinking of the qubits as particles with spin and asking what happens as the number of particles or their dimension changes, while ε is held fixed. In general, going to more particles or higher spins, allows the system to tolerate more mixing with the maximally mixed state and still have states that are not separable. In other words, for a given ε, one can find states of sufficiently large numbers of particles or sufficiently high spin for which ρ_(ε) is non-separable. This yields an upper bound on the size of separable neighborhood around the maximally mixed state.

Consider now two spin-(d-1)/2 particles, each living in a d-dimensional Hilbert space. Each of these particles is an aggregate of N/2 spin-1/2 particles (qubits), in which case d=2^(N/2). Consider a specific joint density matrix of the two particles, ρ_(ε)=(1−ε)M _(d) ₂ +ε|ψ

ψ|, where |ψ

is a maximally entangled state of the two particles, $\left. \psi \right\rangle = {\frac{1}{\sqrt{d}}{\left( {{\left. 1 \right\rangle\left. 1 \right\rangle} + {\left. 2 \right\rangle\left. 2 \right\rangle} + \ldots + {\left. d \right\rangle\left. d \right\rangle}} \right).}}$

Now project each particle onto the subspace spanned by 1

and |2

. The state after projection is $\begin{matrix} {\overset{\sim}{\rho} = {\frac{1}{A}\left( {{\frac{1 - ɛ}{d^{2}}1_{4}} + {\frac{ɛ}{d}\left( {\left. 1 \right\rangle{1}} \right\rangle} + {\left. 2 \right\rangle\left. 2 \right\rangle}} \right)\left( \left\langle {1{\left\langle {1{{+ \left\langle {2{\left\langle {2\left.  \right)} \right)}} \right.}}} \right.}} \right. \right.}} \\ {= {{\left( {1 - ɛ^{\prime}} \right)M_{4}} + {ɛ^{\prime}\left. \phi \right\rangle\left\langle {\phi{,}} \right.}}} \end{matrix}$ where $A = {\frac{4}{d^{2}}\left\lbrack {1 + {ɛ\left( {\frac{d}{2} - 1} \right)}} \right\rbrack}$ is the normalization factor, $\left. \phi \right\rangle = {\frac{1}{\sqrt{2}}\left( {{\left. 1 \right\rangle\left. 1 \right\rangle} + {\left. 2 \right\rangle\left. 2 \right\rangle}} \right)}$ is a maximally entangle state of two qubits, and $ɛ^{\prime} = {\frac{2\quad{ɛ/d}}{A} = {\frac{ɛ\quad{d/2}}{1 + {ɛ\quad\left( {{d/2} - 1} \right)}}.}}$

The projected state {tilde over (p)} is a Werner state, a mixture of the maximally mixed state for two qubits, M₄, and the maximally entangled state |φ

. The proportion ε′ of maximally entangled state increases linearly with d. Thus, as d increases for fixed ε, there is a critical dimension beyond which p becomes entangled. Indeed, the Werner state is non-separable for ε′>⅓ which is equivalent to d>ε⁻¹−1. Moreover, since the local projections on the two particles cannot create entanglement from a separable state, one can conclude that the state (14) of N qubits is non-separable under the same conditions, i.e., if ${ɛ > \frac{1}{1 + d}} = {\frac{1}{1 + 2^{N/2}}.}$

This result establishes an upper bound, scaling as 2^(−N/2) on the size of the separable neighborhood around the maximally mixed state. The general effect of noise on the computation, then the relationship between separability and noise is disclosed below.

Consider a pure-state computational protocol in which the computer starts in the state |ψ₀

and ends in the state |ψ_(ƒ=U|ψ) ₀

, where U is the unitary time evolution operator which describes the computation. The corresponding computation starting with pseudo-pure state ρ=(1−ε)M+ε|ψ ₀

ψ₀| ends up in the state ρ=(1−ε)M+ε|ψ _(ƒ)

ψ_(ƒ)|.

Upon reaching the final state, a measurement is carried out and the result of the computation is inferred from the result of the measurement.

In the most favorable case, that the pure-state protocol gives the correct answer with certainty with a single repetition of the protocol and that if the result of computation is found, one can check it with polynomial overhead. The Pseudo Pure State (PPS) protocol uses the order of 1/ε repetitions. Thus, if ε becomes exponentially small with N. the number governing the scaling of the classical problem (in other words, the noise becomes exponentially large with N), the protocol requires an exponential number of repetitions to get the correct answer. So, for this amount of noise, the quantum protocol with a PPS cannot transform an exponential problem into a polynomial one: even in the best possible case that the pure-state protocol takes one computational step, the protocol with noise takes exponentially many steps. This conclusion applies quite generally to pseudo-state quantum computing and is independent of the discussion of separability, which follows later.

In the PPS there is a probability ε of finding the computer in the “correct” final state |ψ_(ƒ)

arising from the term ε|ψ_(η)

|ψ_(ƒ)

. As stated above, assume here the most favorable case, that if the state is |ψ_(ƒ then, from the outcome of the final measurement, one can infer the solution to the computational problem with certainty with one repetition. In general protocols, such as Shor's algorithm, for example, a single repetition of the protocol is not sufficient to find the correct answer.)

There is also the probability (1−ε) of finding the computer in the maximally mixed state M. In this case, there is a possibility that the correct answer will be found, since the noise term contains all possible outcomes with some probability. However, the probability of finding the correct answer from the noise term must be at least exponentially small with N. Otherwise, there would be no need to prepare the computer at all: one could find the correct answer from the noise term simply by repeating the computation a polynomial number of times. In fact, if the probability of finding the correct answer from the noise term did not become exponentially small with N, one could dispense with the computer altogether. For using a classical probabilistic protocol, which selected from all the possibilities at random, one would get the correct answer with probability of the order of one with only a polynomial number of trials.

Thus, the probability of finding the correct answer from the pseudo-pure state is essentially ε and so the computation must be repeated 1/ε times on average to find the correct answer with probability of order one.

Now consider whether reaching entangled states during the computation is a necessary condition for exponential speed-up. This is addressed by investigating what can be achieved with separable states. Specifically, impose the condition that the pseudo-pure state remains separable during the entire computation. For an important class of computational protocols, it is shown that this condition implies an exponential amount of noise.

The example protocols shown herein use n=n₁+n₂ qubits of which n₁ are considered to be the input registers, and the remaining n₂ are the output registers. Assume that n₁ and n₂ are polynomial in the number N which describes how the classical problem scales. As stated earlier, the problems in which the quantum protocol gives an exponential speed-up over the classical protocol is to be considered, specifically the classical protocol is exponential in N whereas, the quantum protocol is polynomial in N. (For example, in the factorization problem, the aim is to factor a number of the order of 2^(N). The classical protocol is exponential in N and, in Shor's algorithm, n₁ and n₂ are linear in N.)

In describing the protocols as applied to pure states, the first steps are as follows:

Prepare the system in the initial state: |ψ₀

=|00 . . . 0

⊕|00 . . . 0

Perform a Hadamard transform on the input register, so that the state becomes $\left. \psi_{1} \right\rangle = {\frac{1}{2^{n_{1}/2}}{\sum\limits_{x = 0}^{2^{n_{1}} - 1}{\left. x \right\rangle \otimes \left. {00\quad\ldots\quad 0} \right\rangle}}}$

Evaluate the function ƒ:

0,1

^(n) ¹ →

0, 1

^(n) ² . The state becomes $\left. \psi_{2} \right\rangle = {\frac{1}{2^{n_{1}/2}}{\sum\limits_{x = 0}^{2^{n_{1}} - 1}{\left. x \right\rangle \otimes {\left. {f(x)} \right\rangle.}}}}$

Now consider the protocol when applied to a mixed state input. Thus, the initial state ρ₀ is ρ=(1−ε)M ₂ _(n) +ε|ψ₀

ψ₀|, where M₂ _(n) is the maximally mixed state in the 2^(n) dimensional Hilbert space. After the second computational step the state is ρ=(1−ε)M ₂ _(n) +ε|ψ₂

ψ₂|.

Consider now protocols in which the function ƒ(x) is not constant. Let x₁ and x₂ values of x such that ƒ(x₁)≠ƒ(x₂). Thus the state |ψ₂

can be written as ${\left. \psi_{2} \right\rangle = {\frac{1}{2^{n_{1}/2}}\left\{ {{\left. x_{1} \right\rangle\left. {f\left( x_{1} \right)} \right\rangle} + {\left. x_{2} \right\rangle\left. {f\left( x_{2} \right)} \right\rangle} + \left. \psi_{r} \right\rangle} \right\}}},$ where |ψ_(r)

has no components in the subspace spanned by |x₁

|ƒ(x₁)

, |x₁

|ƒ(x₂)

, |x₂

|ƒf(x₁)

, |x₂

|ƒ(x₂)

. It is convenient to relabel these states and write ${\left. \psi_{2} \right\rangle = {\frac{1}{2^{n_{1}/2}}\left\{ {{\left. 1 \right\rangle\left. 1 \right\rangle} + {\left. 2 \right\rangle\left. 2 \right\rangle} + \left. \psi_{r} \right\rangle} \right\}}},$ where |ψ_(r)

has no components in the subspace spanned by |1

|1

, |1

, |2

, |2

|1

, |2

|1

.

A necessary condition on ε for the state of the system to be separable throughout the computation is obtained by considering projecting each particle onto the subspace spanned by |1

and |2

. The state after projection is $\begin{matrix} {\rho_{2}^{\prime} = {{\frac{1}{A}\left\lbrack {{\frac{4\left( {1 - ɛ} \right)}{2^{n_{1} + n_{2}}}M_{4}} + {\frac{2ɛ}{2^{n_{1}}}\left( \frac{{\left. 1 \right\rangle\left. 1 \right\rangle} + {\left. 2 \right\rangle\left. 2 \right\rangle}}{\sqrt{2}} \right)\left( \frac{{\left\langle 1 \right.\left\langle 1 \right.} + {\left\langle 2 \right.\left\langle 2 \right.}}{\sqrt{2}} \right)}} \right\rbrack} =}} \\ {= {{\left( {1 - ɛ^{\prime}} \right)M_{4}} + {{ɛ^{\prime}\left( \frac{{\left. 1 \right\rangle\left. 1 \right\rangle} + {\left. 2 \right\rangle\left. 2 \right\rangle}}{\sqrt{2}} \right)}\left( \frac{{\left\langle 1 \right.\left\langle 1 \right.} + {\left\langle 2 \right.\left\langle 2 \right.}}{\sqrt{2}} \right)}}} \end{matrix},{where}$ $A = \left( {\frac{4\left( {1 - ɛ} \right)}{2^{n_{1} + n_{2}}} + \frac{2ɛ}{2^{n_{1}}}} \right)$ is the normalization factor, M₄ is the maximally mixed state in the four-dimensional Hilbert space spanned by |1

|1

, |1

|2

, |2

, |1

, |2

|2

, and $ɛ^{\prime} = {\frac{2ɛ}{2^{n_{1}}A} = {\frac{ɛ}{{\left( {1 - ɛ} \right)2^{{- n_{2}} + 1}} + ɛ}.}}$

Now a two qubit state of the form ${\left( {1 - \delta} \right)M_{4}} + {{\delta\left( \frac{{\left. 1 \right\rangle\left. 1 \right\rangle} + {\left. 2 \right\rangle\left. 2 \right\rangle}}{\sqrt{2}} \right)}\left( \frac{{\left\langle 1 \right.\left\langle 1 \right.} + {\left\langle 2 \right.\left\langle 2 \right.}}{\sqrt{2}} \right)}$ is entangled for δ>⅓. Therefore, the original state must have been entangled unless $\left. {ɛ^{\prime} \leq {1/3}}\Rightarrow{ɛ \leq \frac{1}{1 + 2^{n_{2}}}} \right.,$ since local projections cannot create entangled states from un-entangled ones.

Therefore, a computational protocol (for non-constant ƒ) involves starting with a mixed state and, if the state remains separable throughout the protocol, then $ɛ \leq {\frac{1}{1 + 2^{n_{2}}}.}$

However, even in favorable circumstances, a computation with noise ε takes of the order of 1/ε repetitions to get the correct answer with probability of the order of one.

Thus, computational protocols of the sort considered require exponentially-many repetitions. So no matter how efficient the original pure-state protocol is, the mixed-state protocol, which is sufficiently noisy that it remains separable for all N, will not transform an exponential classical problem into a polynomial one.

When |ψ

is entangled but ρ_(PPS) ^(n) is separable, the PPS exhibits pseudo-entanglement. The condition $ɛ < \frac{1}{1 + 2^{{2n} - 1}}$ is sufficient for separability but not necessary. Thus, entanglement will not appear in a quantum unitary computation that starts in a separable PPS whose purity parameter ε obeys $ɛ < {\frac{1}{1 + 2^{{2n} - 1}}.}$ A final measurement in the computational basis will not make entanglement appear either.

Two examples: the solutions of Deutsch-Jozsa and Simon's problems are now shown without entanglement.

For the Deutsch-Jozsa problem, given a function ƒ:

0,1

^(n)→

0,1

in the form of an oracle (or black-box), assume that either this function is promised to be either constant, ƒ(x)=ƒ(y), or that it is balanced, ƒ(x)=0, on exactly half the n-bit strings x. The task is to decide which is the case. A single oracle call (in which the input is given in superposition) suffices for a quantum computing to determine the answer with certainty, whereas no classical computing can be sure of the answer before it has asked 2^(n−1)+1 questions. More to the point, no information at all can be derived from the answer to a single classical oracle call.

The QA of Deutsch-Jozsa (DJ) solves this problem with a single query to the oracle by starting with state |0^(n)

|1

and performing a Walsh-Hadamard transform on all n+1 qubits before and after the application entanglement operator (quantum oracle) U_(ƒ). A measurement of the first n qubits is made at the end (in computational basis), yielding classical n-bit string z.

By virtue of phase kick-back, the initial Walsh-Hadamard transforms and the application of U_(ƒ)results in the following state: ${\left. 0^{n} \right\rangle\left. 1 \right\rangle}\overset{H}{\rightarrow}{{\left( {\frac{1}{\sqrt{2^{n}}}{\sum\limits_{x}\left. x \right\rangle}} \right)\left.  - \right\rangle}\overset{U_{f}}{\rightarrow}{\left( {\frac{1}{\sqrt{2^{n}}}{\sum\limits_{x}{\left( {- 1} \right)^{f{(x)}}\left. x \right\rangle}}} \right){\left.  - \right\rangle.}}}$

Then, if ƒ is constant, the final Walsh-Hadamard reverts the state back to ±|0^(n)

|1

, in which the overall phase is “+” if ƒ(x)=0 for all x and “−” if ƒ(x)=1 for all x. In either case, the result of the final measurement is necessarily z=0. On the other hand, if ƒ is balanced, the phase of half the |x

in the above expression is + and the phase of the other half is −. As a result, the amplitude of |0^(n)

is zero after the final Walsh-Hadamard transforms because each |x

is sent to ${{+ \frac{1}{\sqrt{2^{n}}}}\left. 0^{n} \right\rangle} + \ldots$ by those transforms.

Therefore, the final measurement cannot produce z=0. It follows from the promise that if z=0 it can be concluded that ƒ is constant and if z≠0, then it can be concluded that ƒ is balanced. Either way, the probability of success is 1 and the QA provides full information on the desired answer.

On the other hand, due to the special nature of the DJ-problem, a single query does not change the probability of guessing correctly whether the function is balanced or constant. Therefore, the following proposition holds: When restricted to a single DJ-oracle call, a classical computing algorithm learns no information about type of ƒ. In sharp contrast, the advantage of quantum computing even without entanglement: When restricted to a single DJ-oracle call, a quantum computing whose state is never entangled can learn a positive amount of information about the type of ƒ.

In this case, starting with a PPS in which the pure part is |0^(n)

|1

and its probability is ε, one can still follow the DJ-strategy, but now it becomes a guessing game. One can obtain the correct answer with different probabilities depending on whether ƒ is constant or balanced: If ƒ is constant, then z=0 with the probability ${P\left( {z = \left. 0 \middle| {f\quad{is}\quad{constant}} \right.} \right)} = {ɛ + \frac{1 - ɛ}{2^{n}}}$ because the algorithm started with state |0^(n)

|1

with probability ε, in which case DJ-QA is guaranteed to produce z=0 since ƒ is constant, or it started with a completely mixed state with complementary probability 1−ε, in which case DJ-QA produces a completely random z whose probability of being zero is 2^(−n).

Similarly, ${{P\text{(}z} \neq {0\left. {f\quad{is}\quad{constant}} \right)}} = {\left( {1 - ɛ} \right){\frac{2^{n} - 1}{2^{n}}.}}$

If ƒ is balanced one obtains a non-zero z with probability ${{{P\text{(}z} \neq {0\left. {f\quad{is}\quad{balanced}} \right)}} = {ɛ + {\left( {1 - ɛ} \right)\frac{2^{n} - 1}{2^{n}}}}},$ and z=0 is obtained with probability ${P\text{(}z} = {{0\left. {f\quad{is}\quad{balanced}} \right)} = {\frac{1 - ɛ}{2^{n}}.}}$

Therefore, for all positive ε and all n, an advantage is observed over classical computing.

In particular, this is true for ${ɛ < \frac{1}{1 + 2^{{2n} - 1}}},$ in which case the state remains separable throughout the entire computation in $ɛ < \frac{1}{1 + 2^{{2n} - 1}}$ with n+1 qubits.

An information analysis of the DJ problem without entanglement begins by assuming the a priori probability of ƒ being constant is p (and therefore, the probability that it is balanced is 1−p). The following diagrams describe the probability that zero (or non-zero) is measured, given a constant (or balanced) function, in pure and the totally mixed cases.

The case of pseudo-pure state is the weighted sum of the previous cases. The details of the pseudo-pure case are summarized in the joint probability Table 5.5. TABLE 5.5 Joint probability of function type (X) and measurement (Y) X y = zero y = non-zero constant $p\left( {ɛ + \frac{1 - ɛ}{2^{n}}} \right)$ ${p\left( {1 - ɛ} \right)}\left( {1 - \frac{1}{2^{n}}} \right)$ balanced $\left( {1 - p} \right)\frac{1 - ɛ}{2^{n}}$ $\left( {1 - p} \right)\left( {1 - \frac{1 - ɛ}{2^{n}}} \right)$ P(Y = y) $p_{0} = {{p\quad ɛ} + \frac{1 - ɛ}{2^{n}}}$ 1 − p₀

Thus, the probability p₀ of obtaining z=0 is ${ɛ \cdot p} + {\frac{1 - ɛ}{2^{n}}.}$ To quantify the amount of information gained about the function, given the outcome of the measurement, calculate the mutual information between X and Y, where X is a random variable signifying whether ƒ is constant or balanced, and Y is a random variable signifying whether z=0 or not. Let the entropy function of a probability q be h(q)≡−q log q−(1−q)log(1−q). The marginal probability of Y and X may be calculated from that table, and using Bayes rule, ${{P\text{(}X\left. Y \right)} = \frac{P\left( {Y\left. X \right){P(X)}} \right.}{P(Y)}},$ the conditional probabilities are ${{P\text{(}X} = {{{constant}\left. {Y = {zero}} \right)} = {\frac{p}{P_{0}}\left( {ɛ + \frac{1 - ɛ}{2^{n}}} \right)}}},{{P\text{(}X} = {{{constant}\left. {Y = {{non}\text{-}{zero}}} \right)} = {\frac{p\left( {1 - ɛ} \right)}{1 - P_{0}}\left( {1 - \frac{1}{2^{n}}} \right)}}}$ where $p_{0} = {{P\left( {Y = {zero}} \right)} = {{p\quad ɛ} + {\frac{1 - ɛ}{2^{n}}.}}}$

The conditional entropy is $\begin{matrix} {{H\text{(}X\left. Y \right)} = {\sum\limits_{y}{{P\left( {Y = y} \right)}{h\left( {P\left( {X = {{constant}\left. {Y = y} \right)}} \right)} \right.}}}} \\ {= {{p_{0}{h\left( {\frac{p}{p_{0}}\left\lbrack {ɛ + \frac{1 - ɛ}{2^{n}}} \right\rbrack} \right)}} + {\left( {1 - p_{0}} \right){{h\left( {\frac{p\left( {1 - ɛ} \right)}{1 - p_{0}}\left\lbrack {1 - \frac{1}{2^{n}}} \right\rbrack} \right)}.}}}} \end{matrix}$

Then, the mutual information gained by a single quantum query is $\begin{matrix} {{I\left( {X;Y} \right)} = {{H(X)} - {H\left( X\left. Y \right) \right.}}} \\ {= {{h(p)} - {p_{0}h\left( {\frac{p}{p_{0}}\left\lbrack {ɛ + \frac{1 - ɛ}{2^{n}}} \right\rbrack} \right)} -}} \\ {\left( {1 - p_{\quad 0}} \right){{h\left( {\frac{p\left( {1\quad - \quad ɛ} \right)}{1\quad - \quad p_{0}}\left\lbrack {1 - \frac{1}{\quad 2^{\quad n}}} \right\rbrack} \right)}.}} \end{matrix}$

The mutual information is positive for every ε>0, unless p=0 or p=1. This is more than the zero amount of information gained by a single classical query. For p=1/2 this reduced into $1 - {\frac{1 + {ɛ\left( {2^{n - 1} - 1} \right)}}{2^{n}}{h\left( \frac{1 + {ɛ\left( {2^{n} - 1} \right)}}{2\left( {1 + {ɛ\left( {2^{n} - 1} \right)}} \right)} \right)}} - {\frac{2^{n} - 1 - {ɛ\left( {2^{n - 1} - 1} \right)}}{2^{n}}{h\left( \frac{\left( {ɛ - 1} \right)\left( {2^{n} - 1} \right)}{2\left( {1 + {ɛ\left( {2^{n} - 1} \right)} - 2^{n}} \right)} \right)}}$ and, for very small ${ɛ\left( {{ɛ\bullet}\frac{1}{2^{n}}} \right)},$ using the fact that ${{h\left( {\frac{1}{2} + x} \right)} = {1 - \frac{2x^{2}}{\ln\quad 2} + {O\left( x^{4} \right)}}},$ this expression may be approximate by $\begin{matrix} {{I\left( {X\text{;}Y} \right)} = {1 - {p_{0}{h\left( {\frac{1}{2} + \frac{2^{n}ɛ}{4} + {O\left( {2^{n}ɛ^{2}} \right)}} \right)}} - {\left( {1 - p_{0}} \right)h}}} \\ {\left( {\frac{1}{2} + \frac{ɛ}{1 - \frac{4}{2^{n}}} + {O\left( {2^{n}ɛ^{2}} \right)}} \right)} \\ {= {{\frac{2^{2n}ɛ^{2}}{8\left( {2^{n} - 1} \right)\ln\quad 2} + {O\left( {2^{n}ɛ^{2}} \right)}} > 0}} \end{matrix}$

Consider, for example, the case when ${p = \frac{1}{2}},{n = {{3\quad{and}\quad ɛ} = {\frac{1}{1 + 2^{{2n} + 1}} = {\frac{1}{129}.}}}}$ In this case, I(X; Y)=0.0000972 bits of information are gained. Therefore, some information is gained even for separable PPSs, in contrast to the classical case where the mutual information is always zero. Furthermore, some information is gained even when ε is arbitrarily small.

It is possible to improve the expected amount of information that is obtained by a single call to the oracle by measuring the (n+1)-st qubit and take it into account. Indeed, this qubit should be |1

if the configuration comes from the pure part. Therefore, if that extra bit is |0

, which happens with probability $\frac{1 - ɛ}{2},$ it is known that the PPS contributes the fully mixed part, hence no useful information is provided by z and the situation is better than in the classical case. Indeed, when that extra bit is |1

, which happens with probability $\frac{1 + ɛ}{2},$ the probability of the pure part is enlarged from ε to ${\hat{ɛ} = \frac{2ɛ}{1 + ɛ}},$ and the probability of the mixed part is reduced from ${1 - {ɛ\quad{to}\quad 1} - \hat{ɛ}} = {\frac{1 - ɛ}{1 + ɛ}.}$ The probability of z=0 changes to ${\hat{p}}_{0} = {{p\hat{ɛ}} + \frac{1 - \hat{ɛ}}{2^{n}}}$ and mutual information to ${I\left( {X;Y} \right)} = {\frac{1 + ɛ}{2}\left\lbrack {{h(p)} - {{\hat{p}}_{0}{h\left( {\frac{p}{{\hat{p}}_{0}}\left\lbrack {\hat{ɛ} + \frac{1 - \hat{ɛ}}{2^{n}}} \right\rbrack} \right)}} - {\left( {1 - {\hat{p}}_{0}} \right){h\left( {\frac{p\left( {1 - \hat{ɛ}} \right)}{1 - {\hat{p}}_{0}}\left\lbrack {1 - \frac{1}{2^{n}}} \right\rbrack} \right)}}} \right\rbrack}$ which, for $p = \frac{1}{2}$ and very small ε, gives: ${I\left( {X;Y} \right)} = {{\frac{2^{2n}ɛ^{2}}{4\left( {2^{n} - 1} \right)\ln\quad 2} + {O\left( {2^{n}ɛ^{3}} \right)}} > 0.}$

This is essentially twice as much information as in the above case.

For the specific example of ${p = \frac{1}{2}},{n = {{3\quad{and}\quad ɛ} = \frac{1}{129}}},$ this is 0.000189 bits of information.

In the Simon algorithm, an oracle calculates a function ƒ(x) from n bits to n bits under the promise that ƒ is a two-to-one function, so that for any x there exists a unique y≠x such that ƒ(x)=ƒ(y). Furthermore, the existence of an s≠0 is promised such that ƒ(x)=ƒ(y) for y≠x iff y=x⊕s. The goal is to find s, while minimizing the number of times ƒ is calculated. Classically, even if one calls function ƒ exponentially many times, say ⁴ √{square root over (2 ^(n) )} times, the probability of finding s is still exponentially small with n that is less than $\frac{1}{\sqrt{2^{n}}}.$ However, there exists a QA that requires only O(n) computations of ƒ. The algorithm, due to Simon, is initialized with |0^(n)

|0^(n)

. It performs a Walsh-Hadamard transform on the first register and calculates ƒ for all inputs to obtain ${{\left. 0^{n} \right\rangle\left. 0^{n} \right\rangle}\overset{H}{\rightarrow}{{\frac{1}{\sqrt{2^{n}}}{\sum\limits_{x}{\left. x \right\rangle\left. 0^{n} \right\rangle}}}\overset{U_{f}}{\rightarrow}{\frac{1}{\sqrt{2^{n}}}{\sum\limits_{x}{\left. x \right\rangle\left. {f(x)} \right\rangle}}}}},$ which can be written as ${\frac{1}{\sqrt{2^{n}}}{\sum\limits_{x}{\left. x \right\rangle\left. {f(x)} \right\rangle}}} = {\frac{1}{\sqrt{2^{n}}}{\sum\limits_{x < {x \oplus s}}{\left( {\left. x \right\rangle + \left. {x \oplus s} \right\rangle} \right){f(x)}{\text{〉}.}}}}$

Then, the Walsh-Hadamard transform is performed again on the first register (the one holding the superposition of all |x

), which produces state $\frac{1}{\sqrt{2^{n}}}{\sum\limits_{x < {x \oplus s}}{\sum\limits_{j}{\left( {\left( {- 1} \right)^{j \cdot x} + \left( {- 1} \right)^{{j \cdot x} \oplus {j \cdot s}}} \right)\left. j \right\rangle{\left. {f(x)} \right\rangle.}}}}$

Finally, the first register is measured.

The outcome j is guaranteed to be orthogonal to s (j·s=0) since otherwise, |j

's amplitude (−1)^(j·x)(1+(1−)^(j·s)) is zero. After an expected number of such queries in O(n), one obtains n linearly independent j s that uniquely define s.

For example, let S be the random variable that describe parameter s, and let J be a random variable that describes the outcome of a single measurement. To quantify how much information about S is gained by a single query, assume that S is distributed uniformly in the range [1 . . . 2^(n)−1], its entropy before the first query is H(S)=1g(2^(n)−1)≈n. In the classical case, a single evaluation of ƒ gives no information about S: the value of ƒ(x) on any specific x says nothing about its value in different places, and therefore, nothing about s. However, in the case of the QA, one is assured that s and j are orthogonal. If the measured j is zero, s could still be any one of the (2^(n)−1) non-zero values and no information is gained. But in the overwhelmingly more probable case that j is non-zero, only (2^(n−1)−1) values for s are still possible. Thus, given the outcome of the measurement, the entropy of S drops to approximately n−1 bits and the expected information gain is nearly one bit.

In order to estimate the entropy, let S be a random variable that represents the sought-after parameter of Simon's function, so that ∀x: ƒ(x)=ƒ(x⊕s). Assume that S is distributed uniformly in the range [1 . . . 2^(n)−1]. Given that S=s, and starting with a PPS whose purity is ε, one can find the distribution of the measurement after a single query. With probability ε, one starts with the pure part and measures a j that is orthogonal to s. With probability 1−ε one starts with the totally mixed state and measures a random j . Thus, for j so that ${{j \cdot s} = 0},{{P\left( {J = {{j❘S} = s}} \right)} = {{ɛ\frac{2}{2^{n}}} + \frac{\left( {1 - ɛ} \right)}{2^{n\quad}}}},$ and for j so that ${{j \cdot s} = 1},{{P\left( {J = {{j❘S} = s}} \right)} = {{{\frac{\left( {1 - ɛ} \right)}{2^{n\quad}}.P}\left( {J = {{j❘S} = s}} \right)} = \left\{ {\begin{matrix} \frac{1 + ɛ}{2^{n}} & {{{if}\quad{j \cdot s}} = 0} \\ \frac{1 - ɛ}{2^{n}} & {{{if}\quad{j \cdot s}} = 1} \end{matrix}.} \right.}}$ Putting this together,

The marginal probability of J for any j≠0 is $\begin{matrix} {{P\left( {J = j} \right)} = {\sum\limits_{s}{{P(s)}{P\left( j \middle| s \right)}}}} \\ {= {\frac{1}{\quad 2^{\quad{n\quad - \quad 1}}}\left( {{\sum\limits_{s\quad\bot\quad j}{P\left( j \middle| s \right)}} + {\sum\limits_{s\quad{⫬ \quad{\bot\quad j}}}{P\left( j \middle| s \right)}}} \right)}} \\ {= \frac{{\left( \quad{2^{\quad{n\quad - \quad 1}}\quad - \quad 1} \right)\quad\frac{1\quad + \quad ɛ}{\quad 2^{\quad n}}}\quad + \quad{2^{\quad{n\quad - \quad 1}}\quad\frac{1\quad - \quad ɛ}{\quad 2^{\quad n}}}}{\quad{2^{\quad n}\quad - \quad 1}}} \\ {= {\quad\frac{1\quad - \quad\frac{1\quad + \quad ɛ}{\quad 2^{\quad n}}}{\quad{2^{\quad n}\quad - \quad 1}}}} \end{matrix},$ while for J=0, all values of s are orthogonal, and $\begin{matrix} {{P\left( {J = 0} \right)} = {\sum\limits_{s}{{P(s)}{P\left( {J = \left. 0 \middle| s \right.} \right)}}}} \\ {= {\frac{1}{2^{n} - 1}{\sum\limits_{s\bot j}{P\left( {J = \left. 0 \middle| s \right.} \right)}}}} \\ {= {\frac{1}{2^{n} - 1}\left( {2^{n - 1} - 1} \right)\frac{1 + ɛ}{2^{n}}}} \\ {= \frac{1 + ɛ}{2^{n}}} \end{matrix}.$

By the definition, the entropy of the random variable J is ${{H(J)} = {{- {\sum\limits_{j}{{P\left( {J = j} \right)}1{{gP}\left( {J = j} \right)}}}} = {{{- \left( {1 - \frac{1 + ɛ}{2^{n}}} \right)}1{g\left( \frac{1 - \frac{1 + ɛ}{2^{n}}}{2^{n} - 1} \right)}} - {\frac{1 + ɛ}{2^{n}}1g\frac{1 + ɛ}{2^{n}}}}}},$ and the conditional entropy of J given S=s is $\begin{matrix} {{H\left( {\left. J \middle| S \right. = s} \right)} = {- {\sum\limits_{j}{{P\left( {J = {\left. j \middle| S \right. = s}} \right)}1{{gP}\left( {J = {\left. j \middle| S \right. = s}} \right)}}}}} \\ {= {{{- 2^{n - 1}}\frac{1 + ɛ}{2^{n}}1{g\left( \frac{1 + ɛ}{2^{n}} \right)}} - {2^{n - 1}\frac{1 - ɛ}{2^{n}}1{g\left( \frac{1 - ɛ}{2^{n}} \right)}}}} \\ {= {{{- \frac{1 + ɛ}{2}}1{g\left( \frac{1 + ɛ}{2^{n}} \right)}} - {\frac{1 - ɛ}{2}1{g\left( \frac{1 - ɛ}{2^{n}} \right)}}}} \end{matrix}$

Since the above mentioned expression is independent of the specific values s, it also equals to H(S|J), which is $\sum\limits_{s}{{P\left( {S = s} \right)}{{H\left( {\left. J \middle| S \right. = s} \right)}.}}$ Finally, the amount of knowledge about S that is gained by knowing J is their mutual information: $\begin{matrix} {{I\left( {S;J} \right)} = {I\left( {J;S} \right)}} \\ {= {{H(J)} - {H\left( J \middle| S \right)}}} \\ {= {{{- \left( {1 - \frac{1 + ɛ}{2^{n}}} \right)}1{g\left( \frac{1 - \frac{1 + ɛ}{2^{n}}}{2^{n} - 1} \right)}} + {\left( {2^{n - 1} - 1} \right)\frac{1 + ɛ}{2^{n}}1g}}} \\ {{\frac{1 + ɛ}{2^{n}} + {\frac{1 - ɛ}{2}1{g\left( \frac{1 - ɛ}{2^{n}} \right)}}},} \end{matrix}$

Consider two extremes: in the pure case (=1), I(S;J)=1−O(2^(−n)) and in the totally mixed case (ε=0), I(S;J)=0.

Finally, it can be shown that for small ε the value ${I\left( {S;J} \right)} = {\frac{\left( {2^{n} - 2} \right)ɛ^{2}}{2\left( {2^{n} - 1} \right)1n\quad 2} + {{O\left( ɛ^{3} \right)}.}}$

More formally, based on the conditional probability ${P\left( {J = {\left. j \middle| S \right. = s}} \right)} = \left\{ {\begin{matrix} \frac{2}{2^{n}} & {{{if}\quad{j \cdot s}} = 0} \\ 0 & {{{if}\quad{j \cdot s}} = 1} \end{matrix},} \right.$ it follows that the conditional entropy H(J|S=s)=n−1, which does not depend on the specific s and, therefore, H(J|S)=n−1 as well. In order to find the a proiri entropy of J, calculate its marginal probability ${P\left( {J = j} \right)} = {{\sum\limits_{s}{{P(s)}{P\left( j \middle| s \right)}}} = \left\{ {\begin{matrix} \frac{1 - \frac{2}{2^{n}}}{2^{n} - 1} & {{{if}\quad j} \neq 0} \\ \frac{2}{2^{n}} & {{{if}\quad j} = 0} \end{matrix}.} \right.}$

Thus, $\begin{matrix} {{H(J)} = {- {\sum\limits_{j}{{P\left( {J = j} \right)}1{{gP}\left( {J = j} \right)}}}}} \\ {= {{{- \left( {1 - \frac{2}{2^{n}}} \right)}1g\frac{1 - \frac{2}{2^{n}}}{2^{n} - 1}} - {\frac{2}{2^{n}}1g\frac{2}{2^{n}}}}} \\ {= {{\left( {1 - \frac{2}{2^{n}}} \right)\left( {n + {1g\frac{2^{n} - 1}{2^{n} - 2}}} \right)} + \frac{n - 1}{2^{n - 1}}}} \end{matrix}$ and the mutual information ${I\left( {S;J} \right)} = {{1 - {\frac{2 - {\left( {2^{n} - 2} \right)ɛ^{2}}}{2^{n}}1g\frac{2^{n} - 1}{2^{n} - 2}}} = {1 - {O\left( {2 - n} \right)}}}$ is almost one bit.

In contrast, a single query to a classical oracle provides no information about s. When restricted to a single oracle call, a classical computing algorithm learns no information about Simon's parameter s. Again in sharp contrast, the following result shows the advantage of quantum computing without entanglement, compared to classical computing. When restricted to a single oracle call, a quantum computing algorithm whose state is never entangled can learn a positive amount of information about Simon's parameter s.

For example, starting with a PPS in which the pure part is |0^(n)

|0^(n)

, and its probability is ε, the acquired j is no longer guaranteed to be orthogonal to s. In fact, an orthogonal j is obtained with probability $\frac{1 + ɛ}{2}$ only. For any value of S, the conditional distribution of J as above mentioned is ${P\left( {J = {\left. j \middle| S \right. = s}} \right)} = \left\{ \begin{matrix} \frac{1 + ɛ}{2^{n}} & {{{if}\quad{j \cdot s}} = 0} \\ \frac{1 - ɛ}{2^{n}} & {{{if}\quad{j \cdot s}} = 1} \end{matrix} \right.$ from which it is calculated that the information gained about S given the value of J is ${I\left( {S;J} \right)} = {\quad{{{- \left( {1 - \frac{1 + ɛ}{2^{n}}} \right)}1{g\left( \frac{1 - \frac{1 + ɛ}{2^{n}}}{2^{n} - 1} \right)}} + {\left( {2^{n - 1} - 1} \right)\frac{1 + ɛ}{2^{n}}1g\frac{1 + ɛ}{2^{n}}} + {\frac{1 - ɛ}{2}1{{g\left( \frac{1 - ɛ}{2^{n}} \right)}.}}}}$

The amount of information is larger than the classical zero for every ε>0. This result is true even for ε as small as $\frac{1}{1 + 2^{{2{({2n})}} - 1}},$ in which case the state of the computing is never entangled throughout the computation.

When n=3 and ${ɛ = {\frac{1}{1 + 2^{{4 \cdot 3} - 1}} = \frac{1}{2049}}},$ 147×10⁻⁹ bits of information are gained. 5.3. Quantum Computing for Design of Robust Wise Control

Decomposition of the optimization process in design of a robust KB for an intelligent control system is separated in two steps: (1) global optimization based on a Quantum Genetic Search Algorithm (QGSA); and (2) a learning process based on a QNN for robust approximation of the teaching signal from a QGSA.

FIG. 40 shows the interrelations between Soft Computing and Quantum Soft Computing for simulation, global optimization, quantum learning and the optimal design of a robust KB in intelligent control systems. The main problem of KB-optimization based on soft computing lies in the design process using one solution space for global optimization. As an example, consider a design of a KB for a fixed class of stochastic excitations on a control object. If the design process is based on many solution spaces with different statistical characteristics of stochastic excitations of the control object, then the GA cannot necessarily find a global solution for an optimal KB. In this case, for global optimization, a QGSA is used to find the KB. In one embodiment, optimization methods of intelligent control system structures (based on quantum soft computing) use a modification of simulation methods for quantum computing.

Quantum Control Algorithm for Robust KB-FC Design.

FIG. 41 a is a block diagram of the structure of an intelligent control system based on a PD-fuzzy controller (PD-FC). In FIG. 41 a, a conventional PD (or PID) controller 4102 controls a plant 4103. A control output from the controller 4102 and an output from the plant 4103 are provided to a QGSA 4101. A globally optimized KB from the QGSA 4101 is provided to a Fuzzy Controller (FC) 4104. Gain schedules from the FC 4104 are provided to the PD controller 4102. An error signal, computed as a difference between an output of the plant 4103 and an input signal is provided to the FC 4104 and to the PD controller 4102.

Using a soft computing optimizer, it is possible to design partial KB(i) for the FC 4104 from simulation of control object behaviour using different classes of stochastic excitations. For many cases this KB(i) is not robust if another type of stochastic excitations is applied to the control object (plant) 4103 or if the reference signal is changed. The problem lies in design of a unified robust KB from a number of finite number KB(i) look-up tables created by soft computing and finding a globally optimized KB for intelligent fuzzy control under stochastic excitations.

The KB can be considered as an ordered DB containing control laws of coefficient gains for a traditional PID controller. The superposition operator is used for design of relations between coefficient gains of the PID-FC. Grover's QSA is used for searching of solutions and max operation between decoding states is analogy of the measurement process of solution search.

As described above, in an entanglement-free quantum computation no resource increases exponentially. The concrete example below shows that it is possible to design a robust intelligent globally-optimzed KB using a superposition of non-robust KBs. In this case, the quality of control based on the globally-optimized KB is more effective than the non-robust KBs obtained by local optimization. In this case, wise robust control is introduced, where wise≡intelligent⊕smart. This situation is similar to the Parrondo Paradox in a quantum game. In design process of wise control, entanglement is not used and thus, it is different from Parrondo Paradox.

For an entanglement-free quantum control algorithm for design of a robust wise KB-FC, consider one of the examples of quantum computing approach to design robust wise quantum control. As described, FIG. 41 a shows the structure of an intelligent control system based on a fuzzy PD-controller (PD-FC). A soft computing optimizer is used to a group of partial knowledge bases KB(i) for the PD-FC from fuzzy simulation of behavior of the plant 4103 using different class of stochastic excitations. For many cases, these KB( i) are not robust used with different type of stochastic excitations, changing initial states, or changing the type of reference signals. The problem lies in design of a unified robust globally optimized KB from the KB(i) look-up tables created by soft computing.

The entropy of an orthogonal matrix provides a new interpretation of Hadamard matrices as those that saturate the bound for entropy. This definition plays a role in QAs simulation, while the Hadamard matrix is used for preparation of superposition states and in entanglement-free QAs. The entropy of orthogonal matrices and Hadamard matrices (appropriately normalized) saturate the bound for the maximum of the entropy. The maxima (and other saddle points of the entropy function have an intriguing structure and yield generalizations of Hadamard matrices.)

Consider n random variables with a set of possible outcomes i=1, . . . , n having probabilities p_(i), i=1, . . . , n. Then ${\sum\limits_{i = 1}^{n}p_{i}} = 1$ and the Shannon entropy ${S^{Sh}\left( p_{i} \right)} = {- {\sum\limits_{i = 1}^{n}{p_{i}\ln\quad{p_{i}.}}}}$

Now define entropy of an orthogonal matrix O^(i) _(n), i, j=1, . . . , n. Here O^(i) _(j) are real numbers with the constraint ${\sum\limits_{i = 1}^{n}{O_{j}^{i}O_{k}^{i}}} = {\delta_{jk}.}$ In particular, the j th row of the matrix is a normalized vector for each i=1, . . . , n. It is possible to associate probabilities p_(j) ^((i))=(O^(i) _(j))² with the i th row, as ${\sum\limits_{j = 1}^{n}p_{j}^{(i)}} = 1$ for each i. Define the Shannon entropy for the orthogonal matrix as the sum of the entropies for each row: ${S^{Sh}\left( O_{j}^{i} \right)} = {- {\sum\limits_{i,{j = 1}}^{n}{\left( O_{j}^{i} \right)^{2}\ln\quad{\left( O_{j}^{i} \right)^{2}.}}}}$

The minimum value zero is attained by the identity matrix O^(i) _(j)=δ_(j) ^(i) and related matrices obtained by interchanging rows or changing the signs of the elements. The entropy of the i th row can have the maximum value In n, which is attained when each element of the row is $\pm {\frac{1}{\sqrt{n}}.}$ This gives the bound, S^(Sh)(O^(i) _(j))≦n 1n n.

In general the entropy of an orthogonal matrix cannot attain this bound because of the orthogonality constraint ${\sum\limits_{i = 1}^{n}{O_{j}^{i}O_{k}^{i}}} = \delta_{jk}$ which constraints p_(j) ^((i)) for different rows. In fact the bound is obtained only by the Hadamard matrices (rescaled by $\left. \frac{1}{\sqrt{n}} \right).$ This yields the criterion for the Hadamard matrices (appropriately normalized): those orthogonal matrices which saturate the bound for entropy.

The entropy is large when each element is as close to ${\pm \frac{1}{\sqrt{n}}}s$ possible, i.e., to a main diagonal. Thus, maximum entropy is similar to the maximum determinant condition of the Hadamard. The peaks of the entropy are isolated and sharp in contrast to the determinant.

For, example, a matrix that maximizes the entropy for n=3 is ${n = \left. 3\Rightarrow\begin{pmatrix} {- \frac{1}{3}} & \frac{2}{3} & \frac{2}{3} \\ \frac{2}{3} & {- \frac{1}{3}} & \frac{2}{3} \\ \frac{2}{3} & \frac{2}{3} & {- \frac{1}{3}} \end{pmatrix} \right.};$ $n = \left. 5\Rightarrow{\begin{pmatrix} {- \frac{3}{5}} & \frac{2}{5} & \frac{2}{5} & \frac{2}{5} & \frac{2}{5} \\ \frac{2}{5} & {- \frac{3}{5}} & \frac{2}{5} & \frac{2}{5} & \frac{2}{5} \\ \frac{2}{5} & \frac{2}{5} & {- \frac{3}{5}} & \frac{2}{5} & \frac{2}{5} \\ \frac{2}{5} & \frac{2}{5} & \frac{2}{5} & {- \frac{3}{5}} & \frac{2}{5} \\ \frac{2}{5} & \frac{2}{5} & \frac{2}{5} & \frac{2}{5} & {- \frac{3}{5}} \end{pmatrix}.} \right.$

For n =5, the result is similar as in the case n=3: the magnitudes of the elements in each row are $\frac{2}{5}$ repeated 4 times and a diagonal element is as $\left( {- \frac{3}{5}} \right).$

This set can be generalized for any n. The matrix with $- \frac{n - 2}{n}$ along the diagonal and each off-diagonal as $\frac{2}{n}$ is orthogonal. Each row is normalized as a consequence of the identity: n ²=(n−2)²+2²(n−1).

For each n, there are saddle points apart from maxima and minima.

For n=3 there is a saddle point and the corresponding matrix is $\begin{pmatrix} \frac{1}{2} & \frac{1}{\sqrt{2}} & \frac{1}{2} \\ \frac{1}{\sqrt{2}} & 0 & {- \frac{1}{\sqrt{2}}} \\ \frac{1}{2} & {- \frac{1}{\sqrt{2}}} & \frac{1}{2} \end{pmatrix}.$

The entropy peaks sharply at extrema. Thus, the entropy has a rich set of sharp extrema.

This result shows the role of the Hadamard operator in an entanglement-free QA: with the Hadamard transformation it is possible to introduce maximally-hidden information about classical basis independent states, and the superposition includes this maximal information. Thus, with superposition operator, it is possible to create a new QA without entanglement, while the superposition includes information about the property of the function ƒ.

FIG. 42 shows the structure of the design process for using the above approach in design of a robust KB for fuzzy controllers. The superposition operator used is the particular case of a QFT—the Walsh-Hadamard transform. The KB(i) of the PD-FC includes the set of coefficient gains K=

k_(p)(t), k_(D)(t)

laws received from soft computing simulation using different types of random excitations on the plant 4103. FIG. 43 shows the structure of a quantum control algorithm for design of a robust unified KB-FC from two KBs created by soft computing optimizer for Gaussian (KB(1)) and non-Gaussian (with Rayleigh probability density function)—KB(2) noises.

The algorithm includes the following operations:

-   -   1. Prepare two registers of n qubits in the state |0 . . . 0         εH_(N).     -   2. Apply H over the first register.     -   3. Apply diffusion (interference) operator G over the whole         quantum state.     -   4. Apply max operation over the first register.     -   5. Measure the first register and output the measured value.

Normalized real simulated coefficient gains

K_(p)(t),K_(D)(t)

can be calculated using the values of virtual coefficient gains

k_(P) ^(Q)(t),k_(D) ^(Q)(t)

as logical negation:

k_(P) ^(Q)(t),k_(D) ^(Q)(t)

=1−

k_(p)(t), k_(D)(t)

. For example, if the value of the proportional coefficient gain, k_(p)(t_(i)), is k_(p)(t_(i))=0,2, then k_(P) ^(Q)(t_(i))=1−0,2=0,8.

FIG. 41 b shows the geometrical interpretation of this computational process.

FIG. 42 shows the logical description of superposition between real and virtual values of coefficient gains created by soft computing simulation. For this case four classical states are joint in one non-classical superposition state with amplitude probability $\frac{1}{2}.$

For the above described example, the following coding result: |0₁

→0.2, |1₁

→0.8 is obtained.

In one embodiment, the computational control algorithm includes the following operations:

-   -   1. The current values (for fixed time t_(i)) of the coefficient         gains are coded as real values.     -   2. Hadamard matrices are created for superposition between real         simulated and virtual classical states. The virtual classical         state is calculated from the normalized scale [0,1] (the         complementary quantum law is the logical negation of the real         simulated value). The Hadamard transform joins two classical         states in one non-classical state as a superposition:         ${{\frac{1}{\sqrt{2}}\left\lbrack {\left. 0_{1} \right\rangle +} \right.}1_{1}\text{〉}\text{]}} = {{\frac{1}{\sqrt{2}}\left\lbrack {\left. {Yes} \right\rangle +} \right.}{No}\text{〉}\text{]}}$         that it is not found in classical mechanics. This operation         creates the possibility of extraction of hidden quantum         information from classical contradictory states.     -   3. Grover's diffusion operator is used to provide an         interference operation search for the solution.     -   4. The Max operation is applied to the classical states in the         superposition after the decoding of results.

The results of the quantum computation are used in new control laws (new coefficient gains) from two KB(i), i=1,2 created from soft computing technology {umlaut over (x)}+(x ²−1){dot over (x)}+x=k _(p)(t)e+k _(D)(t){dot over (e)}+ξ( t)   (4.1) under Gaussian random white noise ξ(t).

FIG. 44 b shows the initial control laws of the coefficient gains

k_(P)(t),k_(D)(t)

in a PD-FC created from soft computing technology for similar essentially non-linear control object such as a Van der Pol oscillator under non-Gaussian random noise with Rayleigh probability distribution.

FIG. 44 c shows the computational results of new coefficient gains of PD-FC based on the quantum control algorithm for similar essentially non-linear control objects such as the Van der Pol oscillator using KB's created from soft computing technology. FIG. 44 d shows the results of simulation of the dynamic behavior of the Van der Pol oscillator using PD-FC with different KBs.

The comparison of simulation results represented in FIG. 44 d shows the more robustness degree of quantum PD-FC than in similar classical soft computing cases as a new effect in intelligent control system design. From two non-robust KBs of PD-FCs, one robust KB of PD-FC with quantum computation approach can be designed. This effect is similar to the effect in the above mentioned quantum Parrondo Paradox in quantum game theory, but without using of entanglement.

The comparison of simulation results represented in FIG. 45 shows the higher degree of robustness in quantum PD-FC than in similar classical soft computing cases as a new effect in intelligent control system design.

6. Model Representations of Quantum Operators in Fast QAs

In some cases, the speed of the QA simulation can be improved by using a model representation of the quantum operators. This approach is based on using new operations or adding to existing quantum operators in the QSA structure, and/or structural modifications of the quantum operators in QSA. Grover's algorithm is used as an example herein. One of ordinary skill in the art will recognize that the model representation technique is not limited to Grover's algorithm.

6.1 Grover's QSA Structure with New Additional Quantum Operators

FIG. 46 shows the addition of a new Hadamard operator, for example, between the oracle (entanglement) and the diffusion operators in Grover's QSA. The new Hadamard operator is applied on a workspace qubit (for complementing superposition and changing sign) to produce an algorithm labeled QSA1. Let M denote the number of matches within the search space such that 1≦M≦N, and for simplicity, and without loss of generality, assume that N=2^(n). For this case one can describe the steps of the algorithm as follows. Step Computational operation 1 $\begin{matrix} {{Register}\quad{{preparation}:\quad{{{Prepare}\quad a\quad{quantum}\quad{register}\quad{of}\quad n} + {1\quad{qubits}}}}} \\ {{{all}\quad{in}\quad{state}\quad\left. 0 \right\rangle},{{where}\quad{the}\quad{extra}\quad{qubit}\quad{is}\quad{used}\quad{as}\quad a\quad{workspace}\quad{for}}} \\ {\left. {{\left. {{{\left. {{{{evaluating}\quad{the}\quad{oracle}\quad{U_{f}:}}}W_{0}} \right\rangle =}}0} \right\rangle^{\otimes n}}0} \right\rangle.} \end{matrix}\quad$ 2 Register initialization: Apply Hadamard gate on each of the first n qubits in parallel, so they contain the 2^(n) states, where i is the integer representation of items in the list: $\left. {{{\left. \left. {\left. {{{\left. {W_{1}} \right\rangle = \left( {H^{\otimes n} \otimes I} \right)}}W_{0}} \right\rangle = {\left( {\frac{1}{\sqrt{N}}\sum\limits_{i = 0}^{N - 1}} \right.i}} \right\rangle \right) \otimes}}0} \right\rangle,{N = {2^{n}.}}$ 3 Applying oracle: Apply the oracle U_(f) to map the items in the list to either 0 or 1 simultaneously and store the result in the extra workspace qubit: $\left. \left. {{{{\left. {\left. {{{\left. {W_{2}} \right\rangle = U_{f}}}W_{1}} \right\rangle = {\frac{1}{\sqrt{N}}{\sum\limits_{i = 0}^{N - 1}{\left(  \right.i}}}} \right\rangle \otimes}}0} \oplus {f(i)}} \right\rangle \right) = {\frac{1}{\sqrt{N}}{\sum\limits_{i = 0}^{N - 1}{\left( {\left. i \right\rangle \otimes \left. {f(i)} \right\rangle} \right).}}}$ 4 Completing superposition and changing sign: Apply a Hadamard gate on the workspace qubit. This will extend the superposition for the n + 1 qubits with the amplitudes of the desired states with negative sign as follows: $\begin{matrix} {\left. {{{\left. {W_{3}} \right\rangle = \left( {I^{\otimes n} \otimes H} \right)}}W_{2}} \right\rangle =} \\ {\left. {{{\left. {\frac{1}{\sqrt{N}}{\sum\limits_{i = 0}^{N - 1}{\left(  \right.i}}} \right\rangle \otimes}}\left\lbrack \frac{\left. {{{\left. 0 \right\rangle + \left( {- 1} \right)^{f{(i)}}}}1} \right\rangle}{\sqrt{2}} \right\rbrack} \right),\quad{{P = {{2N} = 2^{n + 1}}}.}} \end{matrix}\quad$ 5 Inversion about the mean: $\left. {{{{\left. {D = {{{H^{{\otimes n} + 1}\left( {{2\left. 0 \right\rangle\left\langle 0 \right.} - I} \right)}H^{{\otimes n} + 1}} = {2{\psi }}}} \right\rangle\left\langle {\psi{{{- I},}}\psi} \right\rangle} = {\frac{1}{\sqrt{P}}\sum\limits_{k = 0}^{P - 1}}}}k} \right\rangle,$ $\begin{matrix} {\left. \left. {{{\left. {\left. \left. {{{\left. {\left. {{{\left. {W_{4}} \right\rangle = D}}W_{3}} \right\rangle = {b{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) + {a{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}1} \right\rangle \right) +} \\ {\left. \left. {{{\left. {\left. \left. {{{\left. {b{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left(  \right.i}}} \right\rangle \otimes}}0} \right\rangle \right) + {a{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left(  \right.i}}}} \right\rangle \otimes}}1} \right\rangle \right),} \end{matrix}\quad$ ${a = {\frac{1}{\sqrt{P}}\left( {3 - \frac{4M}{P}} \right)}};{b = {\frac{1}{\sqrt{P}}\left( {1 - \frac{4M}{P}} \right)}};{{{Ma}^{2} + {\left( {P - M} \right)b^{2}}} = 1.}$ 6 Measurement: Measure the first n qubits, to obtain the desired solution after first iteration $\begin{matrix} {{with}\quad{probability}\quad P_{s}^{(1)}\quad{to}\quad{find}\quad a\quad{match}\quad{out}} \\ {{of}\quad{the}\quad M\quad{possible}\quad{matches}\quad{as}\quad{{follows}:}} \end{matrix}\quad$ ${P_{s} = {{M\left( {a^{2} + b^{2}} \right)} = {{5\quad r} - {8\quad r^{2}} + {4\quad r^{3}}}}},{{r = \frac{M}{N}};}$ with probability P_(ns) to find undesired result out of the states as follows: P_(ns) = (P − 2M)b², where P_(s) + P_(ns) = M(a² + b²) + (P − 2M)b² = 1.

Consider the particular properties of QSA1. In Step 5 of QSA1 it is assumed that indicates a sum over all i, which are desired matches (2M states), and Σ₂ indicates a sum over all i, which are undesired items in the list. Thus, the state |W₃> of QSA1 can be rewritten as follows: $\begin{matrix} {\left. W_{3} \right\rangle = {\frac{1}{\sqrt{P}}{\sum\limits_{i = 0}^{N - 1}\left( {{\left. i \right\rangle \otimes \left. 0 \right\rangle} + {\left( {- 1} \right)^{f{(i)}}\left. 1 \right\rangle}} \right)}}} \\ {= {{\frac{1}{\sqrt{P}}{\sum\limits_{i = 0}^{N - 1}\left( {\left. i \right\rangle \otimes \left\lbrack {\left. 0 \right\rangle - \left. 1 \right\rangle} \right\rbrack} \right)}} + {\frac{1}{\sqrt{P}}{\sum\limits_{i = 0}^{N - 1}{\,_{2}\left( {\left. i \right\rangle \otimes \left\lbrack {\left. 0 \right\rangle + \left. 1 \right\rangle} \right\rbrack} \right)}}}}} \\ {= {{\frac{1}{\sqrt{P}}{\sum\limits_{i = 0}^{N - 1}{\,_{1}\left( {\left. i \right\rangle \otimes \left. 0 \right\rangle} \right)}}} - {\frac{1}{\sqrt{P}}{\sum\limits_{i = 0}^{N - 1}{\,_{1}\left( {{\left. i \right\rangle \otimes \left. {1\text{〉}} \right)} +} \right.}}}}} \\ {{\frac{1}{\sqrt{P}}{\sum\limits_{i = 0}^{N - 1}{\,_{2}\left( {\left. i \right\rangle \otimes \left. 0 \right\rangle} \right)}}} + {\frac{1}{\sqrt{P}}{\sum\limits_{i = 0}^{N - 1}{\,_{1}\left( {\left. i \right\rangle \otimes \left. {1\text{〉}} \right)} \right.}}}} \end{matrix}$ There are M states with amplitude $\left( {- \frac{1}{\sqrt{P}}} \right)$ where f (i)=1, and (P−M) states with amplitude $\left( \frac{1}{\sqrt{P}} \right).$

Applying the Hadamard gate on the extra qubit splits the |i> state (solution states), to M states $\left( {\sum\limits_{i = 0}^{N - 1}{\,_{1}\left( {\left. i \right\rangle \otimes \left. 0 \right\rangle} \right)}} \right)$ with positive amplitude $\left( \frac{1}{\sqrt{P}} \right)$ and Mstates $\left( {\sum\limits_{i = 0}^{N - 1}{\,_{1}\left( {\left. i \right\rangle \otimes \left. 1 \right\rangle} \right)}} \right)$ with negative amplitude $\left( {- \frac{1}{\sqrt{P}}} \right).$

In step 5, the effect of applying the (Grover's) diffusion operator D on the general state ${\sum\limits_{k = 0}^{P - 1}{\alpha_{k}\left. k \right\rangle\quad{produces}\quad{\sum\limits_{k = 0}^{P - 1}{\left\lbrack {{- \alpha_{k}} + {2\left\langle \alpha \right\rangle}} \right\rbrack\left. k \right\rangle}}}},{where}$ $\left\langle \alpha \right\rangle = {\frac{1}{P}{\sum\limits_{k = 0}^{P - 1}\alpha_{k}}}$ (operation of inversion about the mean) is the mean of the amplitudes of all states in the superposition; i.e., the amplitudes α_(k) will be transformed according to the following relation: α_(k)→[−α_(k)+2<α>]. In the discussed case, there are M states with amplitude $\left( {- \frac{1}{\sqrt{P}}} \right)$ and (P−M) states with amplitude $\left( \frac{1}{\sqrt{P}} \right),$ so the mean <α> is as follows: $\left\langle \alpha \right\rangle = {{\frac{1}{P}\left( {M\left( {- \frac{1}{\sqrt{P}}} \right)} \right)} + {\left( {P - M} \right){\left( \frac{1}{\sqrt{P}} \right).}}}$ So, applying D on the system |W₃>, described in step 5 of QSA1, can be understood as follows:

-   (i) The M negative sign amplitudes (solutions) will be transformed     from $\left( {- \frac{1}{\sqrt{P}}} \right)$     to α, where α is calculated as follows: $\begin{matrix}     {a = {{- \left( {- \frac{1}{\sqrt{P}}} \right)} + {\frac{2}{P}\left\lbrack {{M\left( {- \frac{1}{\sqrt{P}}} \right)} + {\left( {P - M} \right)\left( \frac{1}{\sqrt{P}} \right)}} \right\rbrack}}} \\     {= {{\frac{1}{\sqrt{P}}\left( {3 - \frac{4M}{P}} \right)}.}}     \end{matrix}$ -   (ii) The (P−M) positive sign amplitudes will be transformed from     $\left( \frac{1}{\sqrt{P}} \right)$     to b, where b is calculated as follows: $\begin{matrix}     {b = {{- \left( \frac{1}{\sqrt{P}} \right)} + {\frac{2}{P}\left\lbrack {{M\left( {- \frac{1}{\sqrt{P}}} \right)} + {\left( {P - M} \right)\left( \frac{1}{\sqrt{P}} \right)}} \right\rbrack}}} \\     {= {{\frac{1}{\sqrt{P}}\left( {1 - \frac{4M}{P}} \right)}.}}     \end{matrix}$ -   Then, a>b after applying D, and the new system state |W₄> can be     written as step 5 of QSA1. If no matches exist within the     superposition (i.e., M=0), then all the amplitudes will have a     positive sign and applying the diffusion operator D will not change     the amplitudes of the states as follows: -   Substituting $\alpha_{k} = {\frac{1}{\sqrt{P}}\quad{and}}$     $\left\langle \alpha \right\rangle = {\frac{1}{P}\left( {P\left( \frac{1}{\sqrt{P}} \right)} \right)}$     in the relation α_(k)→[−α_(k)+2<α>] gives     $\left. {\alpha_{k} + {2\left\langle \alpha \right\rangle}}\rightarrow{\frac{1}{\sqrt{P}} + {\frac{2}{P}\left( {P\left( \frac{1}{\sqrt{P}} \right)} \right)}} \right. = {\frac{1}{\sqrt{P}} = {\alpha_{k}.}}$

It is possible to produce a second quantum algorithm QSA2 by modifying the structure of the diffusion operator D→D_(part) in step 5 of the modified QSA1 on the partial diffusion operator D_(part) which can work similar to the well-known Grover's operator D except that it performs the inversion about the mean operation only on a subspace of the system. The diagonal representation of the partial diffusion operator D_(part), when applied on n+1 qubits system, can take this form: D→D_(part)=H^({circle around (×)}n+1){circle around (×)}I(2|0><0|−I)H^({circle around (×)}n+1){circle around (×)}I, where the vector |0> used in this operation is a vector of length P=2N=2^(n+1). FIG. 47 shows the steps of QSA2.

The steps of the modified QSA2 can be understood as follows: Step Computational operation 1 $\begin{matrix} {{Register}\quad{{preparation}:\quad{{{Prepare}\quad a\quad{quantum}\quad{register}\quad{of}\quad n} + {1\quad{qubits}}}}} \\ {{{all}\quad{in}\quad{state}\quad\left. 0 \right\rangle},{{where}\quad{the}\quad{extra}\quad{qubit}\quad{is}\quad{used}\quad{as}\quad a\quad{workspace}\quad{for}}} \\ {\left. {{\left. {{{\left. {{{{evaluating}\quad{the}\quad{oracle}\quad{U_{f}:}}}W_{0}} \right\rangle =}}0} \right\rangle^{\otimes n}}0} \right\rangle.} \end{matrix}\quad$ 2 Register initialization: Apply Hadamard gate on each of the first n qubits in parallel, so they contain the 2^(n) states, where i is the integer representation of items in the list: $\left. {{{\left. \left. {\left. {{{\left. {W_{1}} \right\rangle = \left( {H^{\otimes n} \otimes I} \right)}}W_{0}} \right\rangle = {\left( {\frac{1}{\sqrt{N}}\sum\limits_{i = 0}^{N - 1}} \right.i}} \right\rangle \right) \otimes}}0} \right\rangle,{N = {2^{n}.}}$ 3 Applying oracle: Apply the oracle U_(f) to map the items in the list to either 0 or 1 simultaneously and store the result in the extra workspace qubit: $\begin{matrix} {\left. \left. {{{{\left. {\left. {{{\left. {W_{2}} \right\rangle = U_{f}}}W_{1}} \right\rangle = {\frac{1}{\sqrt{N}}{\sum\limits_{i = 0}^{N - 1}{\left(  \right.i}}}} \right\rangle \otimes}}0} \oplus {f(i)}} \right\rangle \right) =} \\ \left. \left. {{{\left. {{\frac{1}{\sqrt{N}}{{\sum\limits_{i = 0}^{N - 1}}_{2}\left( {\left. i \right\rangle \otimes \left. 0 \right\rangle} \right)}} + {\frac{1}{\sqrt{N}}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}1} \right\rangle \right) \end{matrix}\quad$ 4 $\begin{matrix} {{Partial}\quad{{diffusion}:\quad{{Applying}\quad D_{{part}\quad}\quad{on}\quad\left. W_{2} \right\rangle\quad{will}\quad{result}\quad{in}\quad a}}} \\ {{new}\quad{system}{\quad\quad}{described}\quad{as}\quad{{follows}:}} \end{matrix}\quad$ $\begin{matrix} {\left. \left. {{{\left. {\left. {{{\left. {W_{3}} \right\rangle = D_{part}}}W_{2}} \right\rangle = {a_{1}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) +} \\ {\left. \left. {{{\left. {{b_{1}{{\sum\limits_{i = 0}^{N - 1}}_{1}\left( {\left. i \right\rangle \otimes \left. 0 \right\rangle} \right)}} + {c_{1}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}1} \right\rangle \right),} \end{matrix}\quad$ ${{a_{1} = {{2\left\langle \alpha_{1} \right\rangle} - \frac{1}{\sqrt{N}}}};\quad{b_{1} = {2\left\langle \alpha_{1} \right\rangle}};}\quad$ ${{c_{1} = {- \frac{1}{\sqrt{N}}}};\quad{\left\langle \alpha_{1} \right\rangle = \left( \frac{N - M}{N\sqrt{N}} \right)}},\quad{and}$ (N − M)a₁² + Mb₁² + Mc₁² = 1 5 Measurement: Measure the first n qubits, to obtain the desired solution after the iteration $\begin{matrix} {{1.\quad{with}\quad{probability}\quad P_{s}^{(1)}\quad{to}\quad{find}\quad a\quad{match}\quad{out}}\quad} \\ {{of}\quad{the}\quad M\quad{possible}\quad{matches}\quad{as}\quad{{follows}:}} \end{matrix}\quad$ ${P_{s}^{(1)} = {{M\left( {b_{1}^{2} + c_{1}^{2}} \right)} = {{5\quad r} - {8\quad r^{2}} + {4\quad r^{3}}}}},{{r = \frac{M}{N}};}$ 2. with probability P_(ns) to find undesired result out of the states as follows: P_(ns)⁽¹⁾ = (N − M)a₁²,  where  P_(s)⁽¹⁾ + P_(ns)^((s)) = 1.

One aspect of using the partial diffusion operator in searching is to apply the inversion about the mean operation only on the subspace of the system that includes all the states which represent the non-matches and half the number of the states which represent the matches, while the other half will have the sign of their amplitudes inverted. This inversion to the negative sign prepars them to be involved in the partial diffusion operation in the next iteration so that the amplitudes of the matching states get amplified partially in each iteration. The benefit of this is to keep half the number of the states, which represent the matches as a stock each iteration to resist the de-amplification behavior of the diffusion operation when reaching the turning points as seen when examining the performance of the modified QSA2. In step 5 of modified QSA2 applying D_(part) can be understood as follows: without loss of generality, the general system ${\sum\limits_{k = 0}^{P - 1}{\delta_{k}\left. k \right\rangle}},{{\delta_{k}{^{2}{= 1}}}}$ can be rewritten as ${{\sum\limits_{k = 0}^{P - 1}{\delta_{k}\left. k \right\rangle}} = {{\sum\limits_{j = 0}^{N - 1}{\alpha_{j}\left( {\left. j \right\rangle \otimes \left. 0 \right\rangle} \right)}} + {\sum\limits_{j = 0}^{N - 1}{\beta_{j}\left( {\left. j \right\rangle \otimes \left. 1 \right\rangle} \right)}}}},$ where

α_(j)=δ_(k):k even

and

β_(j)=δ_(k):k odd

, and then applying D_(part) on the system gives $\begin{matrix} {{D_{part}\left( {\sum\limits_{k = 0}^{P - 1}{\delta_{k}\left. k \right\rangle}} \right)} = {\left\lbrack {{H^{{\otimes n} + 1} \otimes {I\left( {{2\left. 0 \right\rangle\left\langle 0 \right.} - I} \right)}}{H^{{\otimes n} + 1} \otimes I}} \right\rbrack\left( {\sum\limits_{k = 0}^{P - 1}{\delta_{k}\left. k \right\rangle}} \right)}} \\ {= {{{2\left\lbrack {{H^{{\otimes n} + 1} \otimes {I\left( {2\left. 0 \right\rangle\left\langle 0 \right.} \right)}}{H^{{\otimes n} + 1} \otimes I}} \right\rbrack}\left( {\sum\limits_{k = 0}^{P - 1}{\delta_{k}\left. k \right\rangle}} \right)} - \left( {\sum\limits_{k = 0}^{P - 1}{\delta_{k}\left. k \right\rangle}} \right)}} \\ {{= {{\sum\limits_{j = 0}^{N - 1}{\left\lbrack {{2\left\langle \alpha \right\rangle} - \alpha_{j}} \right\rbrack\left( {\left. j \right\rangle \otimes} \right.0\text{〉}\text{)}}} - {\sum\limits_{j = 0}^{N - 1}{\beta_{j}\left( {\left. j \right\rangle \oplus \left. 1 \right\rangle} \right)}}}},} \end{matrix}$ where $\left\langle \alpha \right\rangle = {\frac{1}{N}{\sum\limits_{j = 0}^{N - 1}\alpha_{i}}}$ is the mean of the amplitudes of the subspace ${\sum\limits_{j = 0}^{N - 1}{{{\alpha_{j}\left( {j} \right\rangle} \otimes \text{}}0\text{〉}\text{)}}};$ i.e., applying the operator D_(part) will perform the version about the mean only on the subspace $\sum\limits_{j = 0}^{N - 1}{{{\alpha_{j}\left( {j} \right\rangle} \otimes \text{}}0\text{〉}\text{)}}$ and will only change the sign of the amplitudes for the rest of the system as $\sum\limits_{j = 0}^{N - 1}{{{\beta_{j}\left( {j} \right\rangle} \otimes \text{}}1\text{〉}{\text{)}.}}$

FIG. 48 shows one embodiment of a circuit implementation using elementary gates. The probability of finding a solution varies according to the number of matches M≠0 in the superposition.

Consider the performance of the modified QSA1 and QSA2 after iterating the algorithm once. Table 6.1 shows the results of probability calculations. The maximum probability is always 1, and minimum probability (worst case) decreases as the size of the list increases, which is expected for small M≠0 because the number of states will increase, and the probability is distributed over more states, while the average probability increases as the size of the list increases. TABLE 6.1 Algorithm performance with different size search space n, N = 2^(n) Max probability Min probability Average probability 2 1 0.8125 0.875 3 1 0.507812 0.93750 4 1 0.282227 0.96875 5 1 0.148560 0.984375 6 1 0.076187 0.992187

In the measurement process in step 6 of QSA1, for the first iteration, $\begin{matrix} {P_{s\quad 1}^{(1)} = {M\left( {a_{1}^{2} + b_{1}^{2}} \right)}} \\ {= {\frac{M}{2N}\left( {10 - {16\left( \frac{M}{N} \right)} + {8\left( \frac{M}{N} \right)^{2}}} \right)}} \\ {{= {{5r} - {8r^{2}} + {4r^{3}}}},r} \\ {= {\frac{M}{N}.}} \end{matrix}$ The above equation implies that the average performance of the algorithm to find a solution increases as the size of the list increases. Taking into account that the oracle U_(f) is taken as a black, box, one can define the average probability of success average(P_(s)) of the algorithm as follows: $\begin{matrix} {{{average}\left( P_{s} \right)} = {{\frac{1}{2^{N}}{\sum\limits_{M = 1}^{N}{{{}_{\quad}^{}{}_{}^{\quad}}P_{s}}}} = {\frac{1}{2^{N}}{\sum\limits_{M = 1}^{N}{\frac{N!}{{M!}{\left( {N - M} \right)!}} \cdot {M\left( {a^{2} + b^{2}} \right)}}}}}} \\ {= {\frac{1}{2^{N + 1}N^{3}}{\sum\limits_{M = 1}^{N}{\frac{N!}{{\left( {M - 1} \right)!}{\left( {N - M} \right)!}} \cdot}}}} \\ {\left( {{10N^{2}} - {16{MN}} + {8M^{2}}} \right) = {{1 - \frac{1}{2N}}.}} \end{matrix}$ where ${{}_{\quad}^{}{}_{}^{\quad}} = \frac{N!}{{M!}{\left( {N - M} \right)!}}$ is the number of possible cases for M matches. As the size of the list increases (N→∞), average (P_(s)) tends to 1.

For QSA2 in step 5, the following relations hold: $\begin{matrix} {{{average}\left( P_{s\quad 2}^{(1)} \right)} = {{\frac{1}{2^{N}}{\sum\limits_{M = 1}^{N}{{{}_{\quad}^{}{}_{}^{\quad}}P_{s}}}} = {\frac{1}{2^{N}}{\sum\limits_{M = 1}^{N}{\frac{N!}{{M!}{\left( {N - M} \right)!}} \cdot {M\left( {b_{1}^{2} + c_{1}^{2}} \right)}}}}}} \\ {= {\frac{1}{2^{N + 1}N^{3}}{\sum\limits_{M = 1}^{N}{\frac{N!}{{\left( {M - 1} \right)!}{\left( {N - M} \right)!}} \cdot}}}} \\ {\left( {{10N^{2}} - {16{MN}} + {8M^{2}}} \right) = {1 - \frac{1}{2N}}} \end{matrix}$ where ${{}_{\quad}^{}{}_{}^{\quad}} = \frac{N!}{{M!}{\left( {N - M} \right)!}}$ is the number of possible cases for M matches. As the size of the list increases (N→∞), average (P_(s)) for both QSA ½ tends to 1.

Classically, one can try to find a random guess of the item, which represents the solution (one trial guess), and succeed to find a solution with probability $P_{s}^{({Classical})} = {\frac{M}{N}.}$ The average probability can be calculated as follows: $\begin{matrix} {{{average}\left( P_{s}^{({Classical})} \right)} = {\frac{1}{2^{N}}{\underset{M = 1}{\overset{N}{\sum^{N}}}{C_{M}P_{s}^{({Classical})}}}}} \\ {= {\frac{1}{2^{N}}{\sum\limits_{M = 1}^{N}\frac{M \cdot N}{{M!}{\left( {N - M} \right)!}}}}} \\ {= {\frac{1}{2}.}} \end{matrix}$ This means that there is an average probability of one-half to find (or not to find) a solution by a single random guess, even with the increase in the number of matches.

Grover's QSA has an average probability one-half after an arbitrary number of iterations. The probability of success of Grover's QSA after l iterations is given by: ${P_{s}^{({{Gr}{\lbrack l\rbrack}})} = {\sin^{2}\left( {\left( {{2l} + 1} \right)\theta} \right)}},{{{{where}\quad 0} < \theta < {\frac{\pi}{2}{and}\quad\sin\quad\theta}} = {\sqrt{\frac{M}{N}}.}}$ The average probability of success of Grover's QSA after an arbitrary number of iteration can be calculated as follows: $\begin{matrix} {{{average}\left( P_{s}^{({{Gr}{\lbrack l\rbrack}})} \right)} = {\frac{1}{2^{N}}{\underset{M = 1}{\overset{N}{\sum^{N}}}{C_{M}{\sin^{2}\left( {\left( {{2l} + 1} \right)\theta} \right)}}}}} \\ {= {\frac{1}{2}.}} \end{matrix}$

FIG. 49 shows the probability of success of the three algorithms as a function of the ratio $r = \frac{M}{N}$ for the first iteration of Grover's QSA. FIG. 49 shows that the probability of success of the modified QSA1 is always above that of the classical guess technique. Grover's QSA solves the case where $M = \frac{N}{4}$ with certainty, and the modified QSA1 solves the case where $M = \frac{N}{2}$ with certainty. The probability of success of Grover's QSA will start to go below one-half for ${M > \frac{N}{2}},$ while the probability of success of the modified QSA1 will stay more reliable with a probability of at least 92.6%.

FIG. 50 shows the iterating version of the algorithm QSA1 that works as follows: Step Computational algorithm 1 Initialize the whole n + 1 qubits system to the state |0>. 2 (i) Apply Hadamard gate on each of the first n qubits in parallel. 3 Iterate the following, for iteration k: Apply the oracle U_(f) taking the first n qubits as control qubits and the k th qubit workspace as the target qubit exclusively (ii) Apply Hadamard gate on the k th qubit workspace (iii) Apply diffusion operator on the whole n + k qubit system inclusively 4 Apply measurement on the first n qubits

The second iteration modifies the system as follows: Step Results after second QSA1-iteration 1 Append second qubit workspace to the system: $\begin{matrix} {\left. {\left. \left. {\left. {\left. \left. {\left. W_{1}^{(2)} \right\rangle = {b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.0}}}} \right\rangle \right) \otimes {0}} \right\rangle + {a_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.1}}}} \right\rangle \right) \otimes {0}} \right\rangle +} \\ {\left. \left. {{\left. \left. {b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.0}}} \right\rangle \right) \otimes \left. 0 \right\rangle} + {b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.1}}}} \right\rangle \right) \otimes \left. 0 \right\rangle} \end{matrix}\quad$ 2 Apply U_(f) as shown in Step 3-(i) of QSA1: $\begin{matrix} {{\left. \left. {{\left. \left. {\left. W_{2}^{(2)} \right\rangle = {b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.0}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} + {a_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.1}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} +} \\ {\left. \left. {{\left. \left. {b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.0}}} \right\rangle \right) \otimes \left. 0 \right\rangle} + {b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.1}}}} \right\rangle \right) \otimes \left. 0 \right\rangle} \end{matrix}\quad$ 3 Apply  Hadamard  gate  on  second  qubit  workspace  (I^(⊗n + 1) ⊗ H): $\begin{matrix} {{\left. \left. {\left. {\left. \left. \quad{\left. W_{3}^{(2)} \right\rangle = {\frac{1}{\sqrt{2}}b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.0}}}} \right\rangle \right) \otimes {1}} \right\rangle - {\frac{1}{\sqrt{2}}b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.0}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} +} \\ {{\left. \left. {\left. {\left. \left. {\frac{1}{\sqrt{2}}a_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.1}}} \right\rangle \right) \otimes {0}} \right\rangle - {\frac{1}{\sqrt{2}}a_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.1}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} +} \\ {{\left. \left. {\left. {\left. \left. {\frac{1}{\sqrt{2}}b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.0}}} \right\rangle \right) \otimes {0}} \right\rangle + {\frac{1}{\sqrt{2}}b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.0}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} +} \\ {\left. \left. {\left. {\left. \left. {\frac{1}{\sqrt{2}}b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.1}}} \right\rangle \right) \otimes {0}} \right\rangle + {\frac{1}{\sqrt{2}}b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.1}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} \end{matrix}\quad$ 4. Apply diffusion operator as shown in Step 3-(iii) of QSA1: $\begin{matrix} {{\left. \left. {{\left. \left. \quad{\left. W_{4}^{(2)} \right\rangle = {b_{0}^{(2)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.0}}}} \right\rangle \right) \otimes \left. 0 \right\rangle} + {b_{1}^{(2)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.0}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} +} \\ {{\left. \left. {{\left. \left. {a_{0}^{(2)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.1}}} \right\rangle \right) \otimes \left. 0 \right\rangle} + {a_{0}^{(2)}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left( {\left. i \right\rangle \otimes} \right.1}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} +} \\ {{\left. \left. {\left. {\left. \left. {b_{0}^{(2)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.0}}} \right\rangle \right) \otimes {0}} \right\rangle + {b_{0}^{(2)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.0}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} +} \\ {\left. \left. {\left. {\left. \left. {b_{0}^{(1)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.1}}} \right\rangle \right) \otimes {0}} \right\rangle + {b_{0}^{(2)}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left( {\left. i \right\rangle \otimes} \right.1}}}} \right\rangle \right) \otimes \left. 1 \right\rangle} \end{matrix}\quad$

Where the mean of the amplitudes to be used in the diffusion operator is calculated as follows: $\left\langle \alpha_{2} \right\rangle = {{\frac{1}{2^{n + 2}}\left\lbrack {\left( {2^{n + 2} - {4M}} \right)\frac{b_{0}^{(1)}}{\sqrt{2}}} \right\rbrack} = {\frac{b_{0}^{(1)}}{\sqrt{2}}{\left( {1 - {4M}} \right).}}}$

To clear ambiguity, a and b used in the above section for first iteration are denoted as a₀ ⁽¹⁾ and b₀ ⁽¹⁾ respectively, where the superscript index denotes the iteration and subscript index is used to distinguish amplitudes.

The new amplitudes a₀ ⁽²⁾, a₁ ⁽²⁾ b₀ ⁽²⁾, b₁ ⁽²⁾ are calculated as follows: ${a_{0}^{(2)} = {{2\left\langle \alpha_{2} \right\rangle} - {\frac{1}{\sqrt{2}}a_{0}^{(1)}}}};{a_{1}^{(2)} = {{2\left\langle \alpha_{2} \right\rangle} + {\frac{1}{\sqrt{2}}a_{0}^{(1)}}}}$ ${b_{0}^{(2)} = {{2\left\langle \alpha_{2} \right\rangle} - {\frac{1}{\sqrt{2}}b_{0}^{(1)}}}};{b_{1}^{(2)} = {{2\left\langle \alpha_{2} \right\rangle} + {\frac{1}{\sqrt{2}}{b_{0}^{(1)}.}}}}$

The probability of success is: P_(s) ⁽²⁾=M[(a₀ ⁽²⁾)²+(a₁ ⁽²⁾)²+(b₀ ⁽²⁾)²+(b₁ ⁽²⁾)²].

In general, after e iterations, the recurrent relations representing the iteration can be written as follows: for the initial conditions ${a_{0}^{(0)} = {b_{0}^{(0)} = \frac{1}{\sqrt{N}}}},$

-   1. The mean to be used in the diffusion operator is:     ${\left\langle \alpha_{2} \right\rangle = {\frac{b_{0}^{({l - 1})}}{\sqrt{2}}\left( {1 - {4M}} \right)}};{l \geq 1}$ -   2. The new amplitudes of the system are:     ${a_{0}^{(1)} = {{2\left\langle \alpha_{2} \right\rangle} + {\frac{1}{\sqrt{2}}a_{0}^{(0)}}}};{a_{0->{2^{l - 1} - 1}}^{(2)} = {{2\left\langle \alpha_{l} \right\rangle} \mp {\frac{1}{\sqrt{2}}a_{0->{2^{l - 2} - 1}}^{({l - 1})}}}};{l \geq 2}$     ${b_{0}^{(1)} = {{2\left\langle \alpha_{2} \right\rangle} - {\frac{1}{\sqrt{2}}b_{0}^{(0)}}}};{b_{0->{2^{l - 1} - 1}}^{(2)} = {{2\left\langle \alpha_{l} \right\rangle} \mp {\frac{1}{\sqrt{2}}b_{0->{2^{l - 2} - 1}}^{({l - 1})}}}};{l \geq 2}$ -   3. The probability of success for l≧2 is:     P _(s) ^((l)) =M[(a _(i) ^((l)))²+(b _(i) ^((l)))² ];i=0,1,2, . . .     ,2⁻¹−1     or, using mathematical induction, the probability of success can     take the following form:     ${P_{s}^{(l)} = {{\left( {\frac{M}{N} - 1} \right)\left( {1 - \frac{M}{N}} \right)^{2l}} + 1}},{l \geq 1.}$

FIG. 51 shows the iterating version of the QSA2 algorithm. The iterating block applies the oracle U_(f) and the operator D_(part) on the system in sequence. Consider the system after the first iteration, a second iteration modifies the system as follows: Step Results after second QSA2-iteration 1 Apply the oracle U1 will swap the amplitudes of the states which represent only the matches; i.e., states with amplitudes b₁ will be with amplitudes c₁, and states with amplitudes c₁ will be with amplitudes b₁, so the system can be described as: $\left. \left. {{{\left. {\left. \left. {{{\left. {\left. \left. {{{\left. {\left. {W_{4}} \right\rangle = {a_{1}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) + {c_{1}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) + {b_{1}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}1} \right\rangle \right)\quad$ 2 Applying the operator D_(part) will change the system as follows: $\left. \left. {{{\left. {\left. \left. {{{\left. {\left. \left. {{{\left. {\left. {W_{5}} \right\rangle = {a_{2}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) + \quad{b_{2}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) + {c_{2}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}1} \right\rangle \right){\quad,}$ where the mean used in the definition of partial diffusion operator D_(part) is: a: $\left\langle \alpha_{2} \right\rangle = {\frac{1}{N}\left\lbrack {{\left( {N - M} \right)a_{1}} + {Mc}_{1}} \right\rbrack}$ and a₂, b₂, c₂ used in this Step 2 of the second iteration are calculated as follows: a₂ = 2⟨α₂⟩ − a₁; b₂ = 2⟨α₂⟩ − c₁; c₂ = −b₁

And for the third iteration Step Results after third QSA2-iteration 1 Apply the oracle U_(f) will swap the amplitudes of the states which represent only the matches as: $\begin{matrix} {\left. \left. {{{\left. {\left. \left. {{{\left. {\left. {{{\left. {{U_{f}}W_{5}} \right\rangle =}}W_{6}} \right\rangle = {a_{2}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) + {c_{2}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) + \quad} \\ \left. \left. {{{\left. {b_{2}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}} \right\rangle \otimes}}1} \right\rangle \right) \end{matrix}\quad$ 2 Applying the operator D_(part) will change the system as follows: $\begin{matrix} {\left. \left. {{{\left. {\left. \left. {{{\left. {\left. {{{\left. {{D_{{part}\quad}}W_{6}} \right\rangle =}}W_{7}} \right\rangle = {a_{3}{{\sum\limits_{i = 0}^{N - 1}}_{2}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) + {b_{3}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}}} \right\rangle \otimes}}0} \right\rangle \right) +} \\ {\left. \left. {{{\left. {c_{3}{{\sum\limits_{i = 0}^{N - 1}}_{1}{\left(  \right.i}}} \right\rangle \otimes}}1} \right\rangle \right),} \end{matrix}\quad$ where the mean used in the definition of partial diffusion operator D_(part) is as: ${\left\langle \alpha_{3} \right\rangle = {\frac{1}{N}\left\lbrack {{\left( {N - M} \right)a_{2}} + {Mc}_{2}} \right\rbrack}},$ and a₃, b₃, c₃ in this Step 2 of the third iteration are calculated as follows: a₃ = 2⟨α₃⟩ − a₂; b₃ = 2⟨α₃⟩ − c₂; c₃ = −b₂

In general, the system of QSA2 after l≧2 iterations can be described using the following recurrence relations: ${{❘{W^{(l)}\text{〉}}} = {{a_{l}{\underset{i = 0}{\overset{N - 1}{\sum_{2}}}\left( {{{i{\text{〉} \otimes}}}0\text{〉}} \right)}} + {b_{l}{\underset{i = 0}{\overset{N - 1}{\sum_{1}}}\left( {{{i{\text{〉} \otimes}}}0\text{〉}} \right)}} + {c_{l}{\underset{i = 0}{\overset{N - 1}{\sum_{1}}}\left( {{{i{\text{〉} \otimes}}}1\text{〉}} \right)}}}},$ where the mean to be used in the definition of the partial diffusion operator D_(part) is as follows: ${\left\langle \alpha_{l} \right\rangle = \left\lbrack {{ya}_{l - 1} + {\left( {1 - y} \right)c_{l - 1}}} \right\rbrack},{y = {1 - r}},{r - \frac{M}{N}},{{{and}\quad a_{l}} = {s\left( {F_{l} - F_{l - 1}} \right)}},{b_{l} = {sF}_{l}},{c_{l} = {{{- {sF}_{l - 1}}{and}{\quad\quad}{F_{l}(y)}} = \frac{\sin\left( {\left\lbrack {l + 1} \right\rbrack\theta} \right)}{\sin(\theta)}}},{s = \frac{1}{\sqrt{N}}},$ where F_(l)(y) is the Chebyshev polynomials of the second kind.

The probabilities of the system are: ${P_{s}^{(l)} = {\left( {1 - {\cos(~\theta)}} \right)\left\lbrack {F_{l}^{2} + F_{l - 1}^{2}} \right\rbrack}},{P_{n\quad s}^{(l)} = {{\cos(\theta)}\left\lbrack {F_{l} - F_{l - 1}} \right\rbrack}^{2}},{y = {\cos(\theta)}},{0 \leq \theta \leq \frac{\pi}{2}},$ such that P_(s) ^((l))+P_(ns) ^((l))=1.

It is instructive to calculate how many iterations, l, are required to find the matches with certainty or near certainty for different cases of 1≦M≦N. To find a match with certainty on any measurement, then P_(s) ^((l)) must be as close as possible to certainty.

For interations of the algorithm QSA1, consider the following cases using equation ${P_{s}^{(l)} = {{\left( {\frac{M}{N} - 1} \right)\left( {1 - \frac{M}{N}} \right)^{2l}} + 1}},{l \geq 1.}$ The number of iterations W in terms of the ratio $r = \frac{M}{N}$ is represented using Taylor's expansion as follows: ${l \geq \frac{P_{S}^{(l)} - r}{4{r\left( {1 - r} \right)}}},{r = {\frac{M}{N}.}}$

The cases where multiple instances of a match exist within the search space are listed as follows: 1 $\begin{matrix} {{{The}\quad{case}\quad{where}\quad M} = {\frac{1}{2}{N:{{The}\quad{algorithm}\quad{can}\quad{find}\quad a\quad{solution}}}}} \\ {{{with}\quad{certainty}{\quad\quad}{after}\quad{arbitrary}\quad{number}\quad{of}\quad{iterations}}\quad} \\ \left( {{one}\quad{iteration}\quad{is}\quad{enough}} \right) \end{matrix}\quad$ 2 $\begin{matrix} {{{{The}\quad{case}\quad{where}\quad M} > {\frac{1}{2}N:{{The}\quad{probability}\quad{of}\quad{success}\quad{is}}}},} \\ {{{for}\quad{instance}},{{at}\quad{least}\quad 92.6\%\quad{after}\quad{the}\quad{first}\quad{iteration}},\quad{95.9\%\quad{after}}} \\ {{{second}\quad{iteration}},{{and}\quad 97.2\%\quad{after}\quad{third}\quad{iteration}}} \end{matrix}\quad$ 3 $\begin{matrix} {{For}\quad{iterating}\quad{the}\quad{algorithm}\quad{once}\quad\left( {\ell = 1} \right)\quad{and}\quad{to}\quad{get}\quad a\quad{probability}} \\ {{{of}\quad{at}\quad{least}\quad{one}\text{-}{half}},{so},{{M\quad{must}\quad{satisfy}\quad{the}\quad{condition}\quad M} > {\frac{1}{8}N}}} \end{matrix}\quad$

For the case where l≧1, the following conditions must be satisfied: n≧4 and $1 \leq M \leq {\frac{1}{8}{N.}}$ This means that the first iteration will cover approximately 87.5% of the problem with a probability of at least one-half; two iterations will cover approximately 91.84% and three iterations will cover 94.2%. The rate of increase of the coverage range will decrease as the number of iterations increases.

For the algorithm QSA2 to find a match with certainty on any measurement, then P_(s) ^((l)) must be as close as possible to certainty. In this case, consider the following relation: P_(s) ^((l))=1=(1−cos(θ))[F_(l) ²+F_(l-1) ²], ${y = {\cos\quad(\theta)}},{0 \leq \theta \leq {\frac{\pi}{2}.{Then}}},\quad{l = {{\frac{\pi\quad - \quad\theta}{\quad{2\quad\theta}}\quad{or}\quad\theta} = {\frac{\pi}{\quad 2}.}}}$ Using this result, and since the number of iterations must be an integer, then the required number of iterations is ${l = \left\lfloor {\frac{\pi}{2\sqrt{2}}\sqrt{\frac{N}{M}}} \right\rfloor},$ where └ ┘ is the floor operation. The algorithm runs in ${O\left( \sqrt{\frac{N}{M}} \right)}.$

The probability of success of Grover's QSA is as follows: P_(S) ^((l-Gr))=sin²[(2l_(Gr)+1)θ], where ${{\sin^{2}(\theta)} = \frac{M}{N}};{0 \leq \theta \leq \frac{\pi}{2}}$ and the required l_(Gr) is $l_{Gr} = {\left\lfloor {\frac{\pi}{4}\sqrt{\frac{N}{M}}} \right\rfloor.}$

FIG. 52 shows the probability of success of the iterative version of the algorithm QSA1 where l=1, 2, . . . , 6. This algorithm needs $O\left( \frac{N}{M} \right)$ iterations for n≧4 and ${1 \leq M \leq {\frac{1}{8}N}},$ which is similar to classical algorithms behavior. This leads to the conclusion that the first few iterations of the algorithm will provide the best performance and that there will be no substantial gain from continuing to iterate the algorithm.

By contrast, Grover's QSA needs $O\left( \sqrt{\frac{N}{M}} \right)$ to solve the problem, but its performance decreases for $M \geq {\frac{1}{2}{N.}}$ Thus, for the case when the number of solutions M is known in advance, for ${1 \leq M \leq {\frac{1}{8}N}},$ one can use Grover's QSA with ${O\left( \sqrt{\frac{N}{M}} \right)};$ and if ${\frac{1}{8}N} \leq M \leq N$ use QSA1 with O(1).

FIG. 53 shows that Grover's QSA is faster in the case of fewer instances of the solution $\left( {{{ratio}\quad r} = {\frac{M}{N}\quad{is}\quad{small}}} \right)$ and the algorithm QSA1 is more stable and reliable in case of multiple instances of the solution.

Thus, Grover's QSA performs well in the case of fewer instances of the solution, and the performance decreases as the number of solutions increase within the search space; the algorithm QSA1 in general performs better than any pure classical or QSA and still has O√{square root over (N)} for the hardest case and approximately O(1) for $M \geq {\frac{1}{8}{N.}}$

For QSA2, the probability of success is as follows: $\begin{matrix} {{P_{s}^{(l)} = {\left( {1 - {\cos\quad(\theta)}} \right)\left\lbrack {F_{l}^{2} + F_{l - 1}^{2}} \right\rbrack}},{F_{l}(y)}} \\ {{= \frac{\sin\quad\left( {\left\lbrack {l + 1} \right\rbrack\theta} \right)}{\sin\quad(\theta)}},} \end{matrix}$ and $\begin{matrix} {P_{s}^{(l)} = {\left( {1 - {\cos\quad(\theta)}} \right)\left\lbrack {F_{l}^{2} + F_{l - 1}^{2}} \right\rbrack}} \\ {{= {\left( {1 - {\cos(\theta)}} \right)\left\lbrack \frac{{\sin^{2}\quad\left( {\left\lbrack {l + 1} \right\rbrack\theta} \right)} + {\sin^{2}\left( {l\quad\theta} \right)}}{\sin^{2}\quad(\theta)} \right\rbrack}},} \end{matrix}\quad$ where ${{\cos(\theta)} = {1 - \frac{M}{N}}};{0 \leq \theta \leq \frac{\pi}{2}}$ and the required l is $l = {\left\lfloor {\frac{\pi}{2\sqrt{2}}\sqrt{\frac{N}{M}}} \right\rfloor.}$

FIG. 54 shows the probability of success as a function of the ratio $r = \frac{M}{N}$ for both algorithms. For QSA2 the probability will never return to zero once started, and the minimum probability will increase as M increases because of the use of the partial diffusion operator D_(part), which will resist the de-amplification when reaching the turning points as explained in the definition of the partial diffusion operator D_(part); i.e., the problem becomes easier for multiple matches, whereas for Grover's QSA, the number of cases (points) to be solved with certainty is equal to the number of cases with zero-probability after arbitrary number of iterations.

FIG. 55 shows the probability of success as a function of the ratio $r = \frac{M}{N}$ for both algorithms by inserting the calculated number of iterations l_(Gr) and l in P_(S) ^((l-Gr)) and P_(S) ^((l)), respectively. The minimum probability that Grover's QSA can reach is approximately 17.5% when ${r = {\frac{M}{N} = 0.617}},$ while for QSA2, the minimum probability is 87.7% when $r = {\frac{M}{N} = {0.31.}}$ The behavior of QSA2 is similar in this case to the behavior of this algorithm of the first iteration shown in FIG. 55 for ${r = {\frac{M}{N} > 0.31}},$ which implies that if ${r = {\frac{M}{N} > 0.31}},$ then QSA2 runs in O(1), i.e.; the problem is easier for multiple matches.

Thus, using modifications in the quantum operators of Grover's QSA structure, both QSA1 and QSA2, based on QAG-approach, perform more reliably than Grover's QSA in the case of fewer matches (e.g., relatively hard cases) and runs in O(1) in the case of multiple matches (e.g., relatively easy cases).

6.2. Modification of the Superposition Operator in Grover's QSA: Wavelet QSA with Partial Information.

Before applying of Grover's QSA, a bisection between a database and quantum states is necessary. If a superposition of N states is initially prepared, the Grover's QSA amplifies the amplitude of the target state up to around one, while those of other states dwindle down to nearly zero. The amplitude amplification is perfomed by two inversion operations: inversion about the target by the oracle and inversion about the initial state by the Fourier transform. Two simultaneous reflections about two mirrors crossing by an angle α induces a 2α rotation. One can imagine that the inversion in the Grover's QSA rotates the initial state around the target state. If the target state and initial state are denoted by |w> and |ψ>, respectively (here the initial state is prepared by the Fourier transform of a state |k>, i.e.; |ψ>=(FT)|k>), the inversion operators are expressed as O_(|w>)=I−2|w><w|,J_(|w>)=I−2|ψ><ψ|. Since J_(|w>)=(FT)J_(|k>)(FT)^(†),the Grover operator is written as G=(FT)J_(|k>)(FT)^(†)O_(|w>).Then, after applying the operator O(√{square root over (N)}) times, the final state comes to |ψ_(fin)>=G^(O(√{square root over (N)}))(FT)|k>. The probability to obtain the target state is Pr(w)=|<w||ψ_(fin)>>|², which is 1−ε², ε□1. The query complexity of this QSA, the number of callings of the oracle, is therefore O(√{square root over (N)}). The running time has nothing to do with the choice of |k>.

When partial information is given in an unstructed database, one can replace the Fourier transform in Grover's QSA with the Haar wavelet transform. In this case, if a partial information L is given to an unstructed database of size N, then there is an improved speed-up of ${O\left( \sqrt{\frac{N}{L}} \right)}.$

Grover's QSA cannot benefit from the partial information. The fast wavelet WQSA, which is a modification of Grover's QSA can solve this problem by replacing the Fourier transform with the Haar wavelet transform.

The state W^(†)|2^(λ-1)+j> is a superposition of $\frac{N}{L}$ states, where L=2^(λ-1) (λ is given by k) is the partial information about an initial state, while the state (FT)|k> is a superposition of N states. Since the operator is composed of wavelet transforms, the initial state is prepared by applying the inverse wavelet transform W^(†) to a state |k>. The initial state is now |ψ>=W^(†)|k>. The power of the WQSA appears in the initialization procedure.

The Haar wavelet transform W is represented by the sequence of sparse matrices W=W_(n)W_(n-1) . . . W₁, where ${W_{k} = \begin{bmatrix} H_{2^{n - k + 1}} & O_{2^{n - k + 1} \times {({2^{n} - 2^{n - k + 1}})}} \\ O_{{({2^{\quad n} - 2^{\quad{n\quad - \quad k\quad + \quad 1}}})} \times 2^{n - k + 1}} & I_{2^{n} - 2^{n - k + 1}} \end{bmatrix}},{and}$ ${H_{2^{n}} = \begin{bmatrix} 1 & 1 & 0 & 0 & \cdots & 0 \\ 0 & 0 & 1 & 1 & 0 & \vdots \\ \vdots & \vdots & \quad & ⋰ & 0 & 0 \\ 0 & 0 & \cdots & 0 & 1 & 1 \\ 1 & {- 1} & 0 & 0 & \cdots & 0 \\ 0 & 0 & 1 & {- 1} & 0 & \quad \\ \vdots & \vdots & \quad & ⋰ & 0 & 0 \\ 0 & 0 & \cdots & 0 & 1 & {- 1} \end{bmatrix}_{2^{k} \times 2^{k}}},$ where H₂ _(n) is the Haar 1-level decomposition operator, I_(n) is used as the n×n unit matrix, and O_(n,m) as n×m zero matrix. The wavelet transform W is unitary, since the operator H₂ _(n) is unitary.

One of ordinary skill in the art will recognize that other wavelet transforms can be applied to the WQSA. The Haar wavelet transform is described by sparse matrix, and it is observed that the first half of the Haar wavelet basis differs from the second half of the wavelet basis by the phase exp

iπ

. This implies that the destructive and constructive interference between states accepts a set of states containing the target and rejects the other states.

In this sense, other known wavelet bases, e.g., Daubechies's, the discrete Hartle transform as $A_{N} = {{\left( \frac{1 - i}{2} \right)({FT})_{N}} + {\left( \frac{1 + i}{2} \right)({FT})_{N}^{3}}}$ or the fractional discrete Fourier transform as an α-th root of (FT)_(N) is ${F_{N,\alpha} = {{{a_{0}(\alpha)} \cdot 1_{N}} + \ldots + {{a_{3}(\alpha)} \cdot ({FT})_{N}^{3}}}},{{a_{0}(\alpha)} = {\frac{1}{2}\left( {1 + {\mathbb{e}}^{{\mathbb{i}}\quad\alpha}} \right)\cos\quad\alpha}},{{a_{1}(\alpha)} = {\frac{1}{2}\left( {1 - {\mathbb{i}\mathbb{e}}^{\mathbb{i}\alpha}} \right)\sin\quad\alpha}},{{a_{2}(\alpha)} = {\frac{1}{2}\left( {{- 1} + {\mathbb{e}}^{\mathbb{i}\alpha}} \right)\cos\quad\alpha}},{{a_{3}(\alpha)} = {\frac{1}{2}\left( {{- 1} - {\mathbb{i}\mathbb{e}}^{\mathbb{i}\alpha}} \right)\sin\quad\alpha}}$ are not appropriate to play the role of selecting a subset of the N states.

The operator G^((W))=−W^(†)J_(|k>)WO_(|W>) is one iteration of the WQSA. The expected runing time is ${O\left( \sqrt{\frac{N}{L}} \right)}.$

For example, consider the problem of finding a desired one in the set A=

|a>|a=1,2,3, . . . ,2^(n-1)

. Given a partial information that the target state is in the subset A_(λ) ^(j)=

|z>|(j−1)2^(n-λ)≦z≦j2^(n-λ)−1,1<j≦2^(λ)

, one can complete the search task in O(√{square root over (2^(n-λ+1))}) times by choosing the initial state as W^(†)|2^(λ-1)+j>. Only the λ-number is correctly labeled. The partial information may save this problem. Thus, the power of WQSA appears in the initialization procedure.

Consider the case of partial information about k as k≠0,1. Choosing the initial state as |ψ>=W^(†)|k>, k≠0,1 when the target state exists in the restricted domain of the $\frac{N}{L}$ states gives an improved speed-up with the partial information.

Since kε

2,3,4, . . . ,N(=2^(n))−1

, by setting k=2^(λ-1)+j,1≦j≦2^(λ) and λ≧1, and ${N_{1} = {\frac{N}{L} = 2^{n - \lambda + 1}}},$ the initial state |ψ>=W^(†)|k>, k≠0,1 is explicitly, $\left. {\left. {\left. W^{\dagger} \middle| k \right\rangle = {\sum\limits_{\alpha = {{({j - 1})}N_{1}}}^{{{({j - 1})}N_{1}} + \frac{N_{1}}{N_{2}} - 1}\left| \alpha \right.}} \right\rangle - \sum\limits_{\beta = {{{({j - 1})}N_{1}} + \frac{N_{1}}{N_{2}}}}^{{jN}_{1} - 1}} \middle| \beta \right\rangle.$

Let the target state be |w>εA_(λ) ^(j) and the initial state be W^(†)|2^(λ-1)+j>.It suffices to show that it takes O(√{square root over (2 ^(n-λ+1) )}) times for the WQSA to find the target state with the following setting.

Let $N_{1} = {\frac{N}{L} = 2^{n - \lambda + 1}}$

and the wavelet search operator is G^((W))=−W^(†)J_(|k>)WO_(|w>), where W^(†) is the Haar wavelet transform. Step Computational wavelet algorithm 1 Applying  the  operator  W^(†)  to  thek⟩, gives  the  initial  state $\left. {{{\left. {{{\left. \psi \right\rangle = W^{\dagger}}}k} \right\rangle = {{\sum\limits_{\alpha = {{({j - 1})}N_{1}}}^{{{({j - 1})}N_{1}} + \frac{N_{1}}{2} - 1}\left. \alpha \right\rangle} - \sum\limits_{\beta = {{{({j - 1})}N_{1}} + \frac{N_{1}}{2}}}^{{jN}_{1} - 1}}}}\beta} \right\rangle,$ $\begin{matrix} {{{which}\quad{can}\quad{be}\quad{written}\quad{as}\quad{{follows}:\left. \psi \right\rangle}} =} \\ {{{\frac{ɛ_{w}}{\sqrt{N_{1}}}\left. w \right\rangle} + {ɛ_{r}\sqrt{\frac{N_{1} - 1}{N_{1}}}\left. r \right\rangle}},\quad{where}} \end{matrix}\quad$ $\begin{matrix} {{ɛ_{i}\quad\varepsilon\left\{ {\pm 1} \right\}\quad{and}\quad{the}\quad{state}\quad\left. r \right\rangle} =} \\ {\left. {{{\sqrt{\frac{1}{N_{1} - 1}}{\sum\limits_{\gamma \neq w}ɛ_{\gamma}}}}\gamma} \right\rangle\quad{is}\quad{orthogonal}\quad{complement}\quad{of}\quad{the}\quad{target}\quad{{state}.}} \end{matrix}\quad$ 2 $\begin{matrix} {{{The}\quad m\quad{iterations}\quad{of}\quad{the}\quad{operator}\quad G^{(W)}} = {{- W^{\dagger}}J_{k\rangle}{WO}_{{w}\rangle}\quad{create}\quad{the}}} \\ \left. {{{{{following}\quad{state}\text{:}\quad\left. \psi_{m} \right\rangle} = G_{m}^{(W)}}}\psi} \right\rangle \end{matrix}\quad$ 3 The probability to obtain the target state after the m iterations is P_(m) = ⟨wψ_(m)⟩² = cos²(m  θ − φ), ${{where}\quad\theta} = {{{\sin^{- 1}\left( {2ɛ_{w}ɛ_{r}\frac{\sqrt{N_{1} - 1}}{N_{1}}} \right)}\quad{and}\quad\varphi} = {{\cos^{- 1}\left( \frac{ɛ_{w}}{N_{1}} \right)}.}}$

Thus, the total number of iterations is O(√{square root over (2 ^(n-λ+1) )}). If we denote N=2^(n) and L=2^(λ-1), then the running time is written as ${O\left( \sqrt{\frac{N}{L}} \right)}.$

The partial information that the λ-th number j is correctly labeled leads to the application of the WQSA so that the reference section is filled in time. However, note that there is no improvement in running time when the initial state is W^(†)|0> or W^(†)|1), since, in this case, the initial state is still a superposition of N states. Therefore, from the proposition, one can complete the submission in time if the λ is larger than 2.

The described construction provides a way for a quantum search to benefit from partial information. Since the running time of the Grover's QSA has nothing to do with the choice of the unitary operator, the complexity of the WQSA is the same as the Grover QSA. However, the speed-up obtained from the WQSA is $O\left( \sqrt{\frac{N}{L}} \right)$ and is obtained by preparing the initial state as follows: |ψ>=W^(†)|k>. The running time of the WQSA depends on the choice of k, while that the Grover's QSA does not. This is because the state |ψ>=W^(†)|k> is a superposition of states in the restricted domain of $\frac{N}{L}$ states. Therefore, given a partial information L to a unstructured database of size N, there is an improved speed-up of ${O\left( \sqrt{\frac{N}{L}} \right)}.$ 7. Comparison of Different QA Simulation Approaches

FIG. 56 shows comparison of the developed approaches of QA simulation. In case of Grover's QSA FIG. 56 a, shows results from four simulation methods. It is clear that simulation results according with each method are same, but temporal complexity and size of the data base may vary depending on the approach. Direct matrix based approach is more simple, but the qubit number is limited to 12 qubits, since operator matrices are allocated in PC memory. The second approach with algorithmic replacement of the quantum gates permits an increase in the degree of the analyzed function (number of qubits) up to 20 or more. The problem-oriented approach permits quantum gate applications operating directly with the state vector. This permits an exponential decrease in the number of multiplications, and as a result, allows running of Grover's algorithm on a PC. With this approach, it is possible to allocate in PC memory a state vector containing 25-26 qubits. An extreme version of the Grover's QSA is an approach when the state vector is allocated as a sparse matrix, taking in consideration that with an absence of decoherence, most of the values of the probability amplitudes are equal, and as a result there is no need to store of all of the state vector, but only the different parts, which is equal to number of the searched elements +1. Thus, excluding memory limitations, one can simulate up to 1024 qubits or more, with only limitation caused by floating point number representations (with larger number of qubits, probability amplitudes after superposition approach to machine zero).

In the case of Deutsch-Jozsa's algorithm simulation, FIG. 56 b shows three simulation approaches. In this case, the direct matrix based approach has the same limitations as in Grover's algorithm, and a PC permits an order up to 11 qubits. With the algorithmic approach, up to 20 qubits or more qubits is possible. The problem-oriented approach with compression gives the same result as in case of Grover's algorithm.

In case of Simon and Shor's quantum algorithms, FIG. 56 c shows different algorithm structure. The matrix based approach and algorithmic approach are shown. The matrix based approach permits simulation up to 10 qubits, and the algorithmic approach permits simulation up to 20 qubits, or more.

FIG. 57 shows analysis of the quantum algorithms dynamics from the Shannon information entropy viewpoint. FIG. 57 a shows the relation between Shannon information entropy of the state vector of the Grover's QSA for different parameters of the data base. This analysis permits estimation of the number of algorithm iterations required for database search regarding database size. This estimation is shown in FIG. 58.

The results of Shannon entropy behavior are presented in the FIGS. 57 b for Deutsch-Jozsa's algorithm, in FIG. 57 c for Simon QA and in FIG. 57 d for Shor's QA.

FIG. 59 shows the screen shot of the Grover's QSA problem oriented simulator with sparse allocation of the state vector. The result of the simulation for 1000 qubits is presented.

FIG. 60 summarizes the above approaches to QA simulation. The high level structure of the quantum algorithms can be represented as a combination of different superposition entanglement and interference operators. Then depending on algorithm, one can choose corresponding model and algorithm structure for simulation. Depending on the current problem, one can choose (if available) one of the simulation approaches, and depending on approach one can simulate different orders of quantum systems.

Although various embodiments have been described, other embodiments will be apparent to those of ordinary skill in the art. Thus, the present invention is limited only by the claims. 

1. A method for simulating a quantum algorithm on a classical computer, comprising: applying a unitary matrix quantum gate G to an initial vector to produce a basis vector; measuring said basis vector, wherein elements of said quantum gate G are computed on an as-needed basis; repeating said steps of applying and measuring k times, where k is selected to minimize Shannon entropy of said basis vector; and decoding said basis vectors, said decoding including translating said basis vectors into an output vector.
 2. The method of claim 1, wherein said quantum gate G describes an entanglement-free quantum algorithm.
 3. The method of claim 1, wherein said elements of said basis vector comprise one of two pre-computed values.
 4. An intelligent control system comprising a quantum search algorithm configured to minimize Shannon entropy comprising: a genetic optimizer configured to construct one or more local solutions using a fitness function configured to minimize a rate of entropy production of a controlled plant; and a quantum search algorithm configured to search said local solutions to find a global solution using a gate G expressing a fitness function configured to minimize Shannon entropy, said gate G corresponding to an entanglement-free quantum algorithm for efficient simulation, and wherein elements of said gate G are computed on an as-needed basis.
 5. The intelligent control system of claim 4, wherein said global solution comprises weights for a fuzzy neural network.
 6. The intelligent control system of claim 4, wherein said fuzzy neural network is configured to train a fuzzy controller, said fuzzy controller configured to provide control weights to a proportional-integral-differential controller, said proportional-integral-differential controller configured to control said controlled plant.
 7. The intelligent control system of claim 4, wherein said fitness function is step-constrained.
 8. The intelligent control system of claim 4, wherein each element of a state vector of said quantum search algorithm comprises one of a finite number of pre-computed values.
 9. The intelligent control system of claim 4, wherein said quantum search algorithm operates on pseudo-pure states.
 10. A method for global optimization to improve a quality of a sub-optimal solution comprising the steps of: selecting a first gate G corresponding to a first quantum process, modifying said first gate G into a second gate G corresponding to a second quantum process; having pseudo-pure states; applying a first transformation to an initial state to produce a coherent superposition of basis states; applying a second transformation to said coherent superposition using a reversible transformation according to said second gate G to produce coherent output states; applying a third transformation to said coherent output states to produce an interference of output states; and selecting a global solution from said interference of output states.
 11. The method of claim 10, wherein said first transformation is a Hadamard rotation.
 12. The method of claim 10, wherein each of said basis states is represented using qubits.
 13. The method of claim 10, wherein said second transformation is a solution to Shrodinger's equation.
 14. The method of claim 10, wherein said third transformation is a quantum fast Fourier transform.
 15. The method of claim 10, wherein said pseudo-pure states are entanglement-free.
 16. The method of claim 10, wherein said superposition of input states comprises a collection of local solutions to a global fitness function.
 17. A method for terminating iterations of a quantum algorithm, comprising: performing an interation of a quantum algorithm to produce a measurement vector; computing a Shannon entropy of said measurement vector; selecting a termination condition from at least one of: a first local Shannon entropy minimum, a lowest Shannon entropy within a predefined number of iterations; a predefined level of acceptable Shannon entropy; and repeating said performing and computing until said termination condition is satisfied.
 18. The method of claim 17, further comprising measuring a final output result.
 19. The method of claim 17, further comprising measuring an output result at each iteration.
 20. A method for intelligent control comprising a quantum search algorithm corresponding to a quantum system on entanglement-free states configured to minimize Shannon entropy comprising: optimizing one or more local solutions using a fitness function configured to minimize a rate of entropy production of a controlled plant; and searching, using a quantum search algorithm to search said local solutions to find a global solution using a fitness function to minimize Shannon entropy.
 21. The method of claim 20, wherein said global solution comprises weights for a fuzzy neural network.
 22. The method of claim 21 further comprising: training a fuzzy controller, providing control weights from said fuzzy controller to a proportional-integral-differential controller, and using said proportional-integral-differential controller to control said controlled plant.
 23. The method of claim 20, wherein said quantum search algorithm iterates until a first local Shannon entropy minimum is found.
 24. The method of claim 20, wherein said quantum search algorithm iterates until a lowest Shannon entropy is found within a predefined number of iterations.
 25. A global optimizer to improve a quality of a sub-optimal solution, said optimizer comprising of a computer software loaded into a memory, said software comprising: a first module for applying a first transformation to an initial state to produce a coherent superposition of basis states; a second module for applying a second transformation to said coherent superposition using a reversible transformation to produce one or more entanglement-free output states; a third module for applying a third transformation to said one or more coherent output states to produce an interference of output states; and a fourth module for selecting a global solution from said interference of output states.
 26. The optimizer of claim 25, wherein said first transformation is a Hadamard rotation.
 27. The optimizer of claim 25, wherein each of said basis states is represented using qubits.
 28. The optimizer of claim 25, wherein said second transformation is based on a solution to Shrodinger's equation.
 29. The optimizer of claim 25, wherein said third transformation is a quantum fast Fourier transform.
 30. The optimizer of claim 25, wherein said fourth module is configured to find a maximum probability.
 31. The optimizer of claim 25, wherein said superposition of input states comprises a collection of local solutions to a global fitness function.
 32. The optimizer of claim 25, wherein elements of a quantum gate are computed on an as-needed basis.
 33. The optimizer of claim 25, wherein a state vector describing said output states is stored in a compressed format. 